UWTVERSTTY 
OF MIC 
JUL 7- 1959 


CANADIAN —— 
OURNAL OF MATHEMATICS 


Journal Canadien de Mathématiques 


VOL. XI- NO. 3 
1959 


Sur les representations unitaires des groupes de 
Lie nilpotents. IV Jacques Dixmier 


Note on generalized Witt algebras Rimhak Ree 
Supersoluble immersion Reinhold Baer 


Norvlinear recursive sequences Elbert A. Walker 


Extremal properties of Hermitian matrices. II 
M. Marcus, B. N. Moyls, and R. Westwick 


Linear transformations on algebras of matrices: 
the invariance of the elementary symmetric functions 
Marvin Marcus and Roger Purves 


Prime dual ideals in Boolean algebras L. J. Heider 
On a paper of Maurice Sion Mark Mahowald 
On generalized Morse-Transue function spaces H. W. Ellis 
On a type problem James A. Jenkins 


On some properties of functions analytic in a 
half-plane P. G. Rooney 


A network-flow feasibility theorem and 
combinatorial applications D. R. Fulkerson 


The regular maps on a surface of genus three F. A. Sherk 


Published for 
THE CANADIAN MATHEMATICAL CONGRESS 
by the 
University of Toronto Press 





EDITORIAL BOARD 


H. S. M. Coxeter, G. F. D. Duff, R. D. James, R. L. Jeffery, 
J..M. Maranda, G. de B. Robinson, H. Zassenhaus 


with the co-operation of 


A. D. Alexandrov, R. Brauer, W. P. Brown, D. B. DeLury, J. Dixmier, 
P. Hall, N. S. Mendelsohn, P. Scherk, J. L. Synge, A. W. Tucker, 
W. J. Webber, M. Wyman 


The chief languages of the Journal are English and French. 


Manuscripts for publication in the Journal should be sent to the 
Editor-in-Chief, G. F. D. Duff, University of Toronto. Authors are 
asked to write with a sense of perspective and as clearly as possible, 
especially in the introduction. Regarding typographical conventions, 
attention is drawn to the Author’s Manual of which a copy will be 
furnished on request. 


All other correspondence should be addressed to the Managing 
Editor, G. de B. Robinson, University of Toronto. 


The Journal is published quarterly. Subscriptions should be sent 
to the Managing Editor. The price per volume of four numbers 
is $8.00. This is reduced to $4.00 for individual members of recognized 
Mathematical Societies. 


The Canadian Mathematical Congress gratefully acknowledges the 
assistance of the following towards the cost of publishing this Journal: 


University of Alberta Assumption University 
University of British Columbia Carleton College 
Dalhousie University Ecole Polytechnique 
Université Laval Loyola College 
University of Manitoba McGill University 
McMaster University Université de Montréal 
Queen’s University Royal Military College 
St. Mary’s University University of Toronto 

National Research Council of Canada 

and the 
American Mathematical Society 


AUTHORIZED AS SECOND CLASS MAIL, POST OFFICE DEPARTMENT, OTTAWA 








SUR LES REPRESENTATIONS UNITAIRES DES 
GROUPES DE LIE NILPOTENTS. IV 


JACQUES DIXMIER 


Soit m un entier > 1. On notera M, l'ensemble des matrices carrées d’ordre 
n & éléments réels, et G, le groupe des x = (£,) € M, tels que §y = 0 pour 
l<j<k <n, &; = 1 pour 1 <j <n. Le groupe G, est un groupe de Lie 
nilpotent simplement connexe, dont l’algébre de Lie s’identifie A l'ensemble 
@. des x = (En) € M, tels que §, = 0 pour 1 <j <k <n. Nous allons 
déterminer: 

(1°) le centre de l’algébre enveloppante de q,; 

(2°) la série ‘principale’ de représentations unitaires irréductibles de G,; 
(la recherche de toutes les représentations unitaires irréductibles de G, ne 
semble pas facile) ; 

(3°) la formule de Plancherel pour G,; 

(4°) les caractéres globaux (au sens de (5)) des représentations de la série 
principale. 

Pour n impair, l'étude est un peu plus compliquée que pour m pair. On 
exposera les démonstrations en détail pour m pair. Pour m impair, on insistera 
seulement sur les différences de calcul. 

On emploiera les notations suivantes. La matrice (f,) € M, telle que 
E,, = Lett, = 0 pour 7 # r ouk # s sera notée e,,. L’ensemble des matrices 
(Ex) € M, telles que §, = 0 pour j + k # n+ 1 sera noté E,. Pour toute 
matrice x = (&,) € M,, on posera: 


fi fi2... 


Ax(x) ” foo... & 


Ent Eno. ** 


E12 £13 
Ene £03 





E,—1,2 £,~1,3 see 


Aaei(x) = 1. 


Regu le 15 aofit, 1958. 











322 JACQUES DIXMIER 


L'algébre enveloppante d'une algébre de Lie g sera notée U(g). L’algébre 
symétrique d'un espace vectoriel V sera notée S(V). Sur le groupe G,, la 
mesure définie par la forme différentielle Micp< j<ndt, qui est une mesure de 
Haar, sera appelée mesure de Haar canonique, et sera la seule utilisée. De 
méme, sur le groupe additif des matrices (n,) An lignes et n’ colonnes, la 
seule mesure utilisée sera la mesure I< j<nicecn’@n p- 

Les lemmes | et 3 se trouvent dans (3, pp. 8-10 et 12-14). Toutefois, comme 
la situation est ici légérement différente 4 certains égards, on a explicité les 
calculs pour la commodité du lecteur. 


1. Cas o¥ n est pair. Centre de ll(g,). Nous poserons » = 2m. Tout 
x € G, se met sous la forme 


= 
Il 
, tr. 
ss 
x» © 
ee 


ona 


d’ot facilement 


= pi 
x= ( a ~! .) xx'xo' = (>? 4 . , a 
—z wy Zz t 22 2 


avec t = (wy’ + zw’ — 2zz’z~'w)y—'. On voit que l'ensemble A2,, des x € Gam 
tels que y = z= 1 est un sous-groupe distingué abélien de G:,,. L’idéal 
abélien a2, de go, correspondant a A>»,, est l'ensemble des matrices 


ee 
wQ0/’ 


LEMME 1. Soit N,, l'ensemble des w € M,, tels que Ao(w)A3(w) ... A, (w) ¥ 0. 
Si w€ Ny, il existe des éléments y € Gn, = © Gu, € € Em uniques tels que 
w = sey. Sie = (€%), ona 


ou w€ M,. 


m— Jj _A,(w) 


(1) €m—j+1,9 = (—1) Bisi(w) * 


Démonstration. Posant 2~' = 2’, il revient au méme de prouver qu'il existe 
des éléments y € Gn, 2° € Gu, ¢ € En uniques tels que 2’w = ey. Soient 
w= (wp), ¥ = (nx), 2 = (Sp), € = (en). On doit avoir: 





ee SO eee lw 








et 


qi 


“ 


‘7 


‘ 














REPRESENTATIONS UNITAIRES 323 


(2) au Sir On = 0 GG+k>m-+1) 
(3) ss Sgr @rim—j+1 = €5,m—Jj+1 
(4) >i $47 Ore = €5,m—J+1 Nm— $41.8 GgG+k<m-+1). 


Pour j fixé, les équations (2), qui s’écrivent 


j-1 

L fon = — wp (k= m—j+2,m—j+3,...,m) 

— 
forment un systéme de 7 — 1 équations 47 — 1 inconnues, dont le déterminant 
est An—542(w). Ce déterminant est non nul puisque w € N,,. D’od Il’existence 
et l'unicité des {, satisfaisant A (2). Les équations (3) donnent alors les 
€j,m-j+1- D’ailleurs, en éliminant {j,..., , s-1 entre les 7 équations (2)-(3) 
qui contiennent ces inconnues,il vient 





@1,m—j+1 @1.m—j+2+++@ivm 
@2,.m—j+1 W2.m—j4+2+ ++ W2.m 
@j,.m—j+1 ~~ €j,m—j+1 Wj,m—j4+2++ + @jim 





D’ot les formules (1). 
Comme w€ Nx, on voit que €im #0, €2.m-1 #0,..., €m-1.2 #0. Les 
formules (4) prouvent alors l’existence et l’unicité des 7... 


LEMME 2. Soit w = (wy) — f(w) une fonction polynéme sur M,,. Pour qu'on 
ait f(z wy) = f(w) quels que soientw € Mn, y © Gm, 2 € Gm, il faut et il suffit 
que f soit dans l’algébre engendrée par les fonctions A,, Ao, ..., Am. 


Démonstration. Posant w = (wy), ¥ = (nx), 2 = (fn), zwWyY = (wy’), On a 


, 
On = a 4s Ors = ps C4 Wrs Nsk- 
lqrem,.l<s<qm 1er<j.kesem 


Donc 
’ 
(jx) 1<9< t,m— t+1<k<m 
= (F jn) 16 jt, 1<R<t (jx) 16 j<t.m t+ 1<k<&m (n jx) m t+ 1<j<m.m— t+ 1k 


et par suite 
, 
det (jx) 16 5<t,m— 14 1<k<m = det (@ jx) 16 5<t.m- t+ 1<k<m- 


Ceci prouve que la condition de I'énoncé est suffisante. 

Maintenant, soit w = (wx) > f(w) = f((w»)) un polynéme tel que f(z w y) 
= f(w) quels que soient w € M,,, y © Gnu, 2 © Gm. Si on remplace les w,», tels 
que j + k # m+ 1 par 0 dans f((w,)), on obtient un polynéme par rapport 











324 JACQUES DIXMIER 


A Wmi, Om—1,2,--+ >» Wim, Que nous noterons 2(Wmi,.-., Wim). Conservons la 
notation N,, du Lemme 1. Si w € N,,, il existe y € Gp, z € Gn, e = (ex) € En 


tels que w = zey. Ona f(w) = f(e) = g(emi,..., €im)- D’aprés les formules 


(1), 


\ = = m+1 4i(w) ~_ An—i(w) f ») 
(5) f((@p)) e(( 1) rw = OE bday er »Am(w) } . 


Considérons maintenant les matrices w de la forme 


W11 Wi2 ses @1 m—1 ®im 
0 0 pide 0 l 
0 0 er l 0 
0 0 er 0 0 
0 1 0 0 


Quand on restreint f a l’ensemble de ces matrices, on obtient un polynéme 


h(wii,..., @1m) et la formule (5) devient 
> @11 @1.m—1 
(6) Bors» ein) = e 22,..., 2H, ye) 
@i2 Wim 
valable pour w:12 ¥ 0, w13; ¥ 0,..., wim ¥ 0. Les égalités (5) et (6) entrafnent 


f((wx)) = h(+Ai(w),... , —An—i(w), A, (w)), 


égalité valable pour w € N,, et par suite pour toute w € M,, d’aprés le prin- 
cipe d’inconséquence des inégalités algébriques. D’ot le lemme. 


THtorEME 1. Le centre de (gon) est engendré par les éléments algébriquement 
indépendants 
Con~1s1 Ctn~1,2| Cm+1,1 +++ Cm+i,m 
€2m,15 | goces 


Com.1  €2m,2 | €2m,1 +++ €2m,m 


Démonstration. Nous allons d’abord chercher les éléments de G(g2,,) in- 
variants pour la représentation adjointe de g2,. Un élément de S(g»,,) est de 
la forme f((e),>x), od f est un polyndme en n(n — 1) variables a coefficients 
réels. Les seuls crochets non nuls des e, entre eux sont données par les for- 
mules 

[eyes eri] S2éy ete - (ex, € | Gj >k>l). 


La condition que f((e,)) soit invariant pour la représentation adjointe se 
traduit par les égalités 


*Les ¢j, qui figurent dans ces déterminants appartiennent 4 Qom, donc sont deux a deux 
permutables; ainsi, il n’y a pas d’ambiguité sur la signification de ces déterminants. 








me 


nt 


in- 


ent 








pen ee 








REPRESENTATIONS UNITAIRES 325 


y lers, Cs) Sein = 0 (r > s) 
j>k 
c’est-a-dire 
(7) > Cre Seas = > Cis Scie = 0 (r > S$). 
<s jor 


Cette égalité se réduit 4 0 = 0 pourr = met s = 1. Pourr = —1,s = 1, 
elle donne 


, 
ar S tara-t _ 0, 
de sorte que f est indépendant de e,,,-:. Pour r = n, s = 2, elle donne 
, 

Cni Sess = 0, 
de sorte que f est indépendant de é2;. Soit » un entier < m, et supposons 
démontré que f est indépendant des e, pour 7 < p d’une part, pour k > n 
— p+ 1 d’autre part. Ecrivons la condition (7) pour r = n — p, s = 1,2, 

., — (ce qui est possible car p < nm — p). Nous obtenons 


3 Cn Sej-p = 0 


jon—p 


os C92. f65.0-p _ Camp. J ox 


j>n—p 


, , , ’ 
> ie Cie 5 ¢;.0-p = Cn—p,1 5 en: +t Cn~p.2S ens Tie T Cn~9.9-15 p.9-1° 


j>n—p 
D’aprés l’hypothése de récurrence, les deuxiémes membres sont nuls. Ces 
égalités entrainent alors que 


pour 7 > n — p, c’est-a-dire que f est indépendant des ey pour k = n — p. 
Ecrivant maintenant la condition (7) pour s=p+1, r=n—p+l, 
n—p+2,...,m (ce qui est possible car n — p+ 1> p+ 1), on trouve 
de méme que f est indépendant des e, pour 7 = p+ 1. 

Ainsi, f est indépendant des e, pour 7 < m d'une part, pour k > m + 1 
d’autre part, de sorte que f € S(a2,,). Cherchons donc les éléments de S (aon) 
invariants pour la représentation adjointe de go, ou, ce qui revient au méme, 
pour la représentation adjointe p de G2,. Identifions a2, 4 M,, par l’application 


0 0 a 
w 0 ” 
x= G, °) € Gom; 
w 2 
ona 


ns). = 060)-(0 0) = IGG 2) - 


Alors, si 

















326 JACQUES DIXMIER 


Notons A,,, l'automorphisme de l’algébre ©(M,,) qui prolonge l’automor- 
phisme w— zw yy de l’espace vectoriel M,,.. Il s’agit donc de trouver les 
éléments de ©(M,,) qui sont invariants pour les automorphismes A, ,. 

Grface a la forme bilinéaire (w, w’) — tr(ww’) sur M,, nous identifierons 
l'espace vectoriel M,, 4 son dual. Dans cette identification, e, s’identifie a 
la forme linéaire (w,.) — o,, sur M,,. Alors, S(M,,) s’identifie a l'algébre des 
fonctions polynémes sur M,,: a l’élément 

Cjyky ++ - Cinkp 
de S(M,,) correspond la fonction polynéme 
(w jx) —> Wei 5, + + + Mkyip 


Pour y € Gm, 2 € Gm, on a 


tr(w(A,, .w’)) 


ll 


tr(wew’ y') = tr(y  wew’) 


tr((A,-1,,-1 w)w’). 


Donc le transposé de A,,, s’identifie 4 A,_,_,. Alors, d’aprés les propriétés 
élémentaires des algébres symétriques, pour qu'un élément de S(M,,) soit 
invariant par les A, ,, il faut et il suffit que la fonction polynéme correspondante 
soit invariante par les A, ., c’est-a-dire par l’'application w— zw y~' de M,, 


sur M,,. Donc (Lemme 2) les éléments de S(M,,) invariants pour les A,., 
constituent l’algébre engendrée par les éléments 


€m—1,1 €m—1,2 e11 -22 Cim 
€m.1s , *** 5 


Cm 1 Cm .2 Cmi +++ Cmm 


Compte tenu de l’identification adoptée de a2, 4 M,, on voit que les éléments 
de S(g2,) invariants pour la représentation adjointe constituent l’algébre 
engendrée par les éléments 


€2m—1,1 C2m—1,2 


€2m 1 Com .2 | 


| 
| 
€2m 15 re 


Enfin, le centre de U(g2,) est l'image, par l’application canonique @¢ de 
S (Gem) sur U(Gon), de l’algébre précédente. Comme az, est abélien, la restric- 
tion de @ 4 S(a2,) est un isomorphisme de S(az,,) sur U(ae,) C U(g2,). D’od 
le théoréme. 


2. Cas of n est pair. Formule de Plancherel. Nous conservons les 
notations précédentes. Tout élément e€ E, définit la forme linéaire 
w—tr(ew) sur M,,, donc la forme linéaire 


(° 0) A en . 
-@ rlew) 











nm 





es 


} 








REPRESENTATIONS UNITAIRES 327 


sur Qe. D’autre part, l’'application exponentielle de a2, sur Ao, c’est-a-dire 


l'application 
¢9-69 
w 0 w 1 


est un isomorphisme du groupe additif a2,, sur le groupe abélien A>,,, de sorte 
que l’application 


1 0 . << 
( ) + exp étr(om) = expi >, €),m—s+1 Om—s41.) 
w 1 j=l 


(ol e = (€,) et w = (w,)) est un caractére —, de Ao». Nous noterons U, la 
représentation unitaire de G2, induite par &,. 
Tout élément de G2, se met de maniére unique sous la forme 


(23 9)-@,9) 


avec y © Gm, 2 © Gm, w € My. Ainsi, Go, est produit semi-direct de As, et 
d’un groupe canoniquement isomorphe a G,, X G,,. En outre 


© FR 9) - 29) - Cay y 08? 2) 
0 2'/\w 2) \e'weae't/ — \e'wy yy" 1/\0— a's] * 


Par suite, l’espace hilbertien of opére U, s’identifie canoniquement a L,’ 
(Gm X Gm), Gn X Gm étant muni de sa mesure de Haar canonique et C désig- 


9 


nant le corps complexe; et, si (y’, 2’) — f(y’, 2’) est un élément de L¢?(G,, XG), 
la formule (8) prouve que, pour 


ona 


ou encore 
r . =f gal 
(9) (U.(x)f)(y’, 2’) = f(y’ x, 2’ 2) expitr(es’wy yy"). 
La formule (9) définit explicitement la représentation U,. 
TutoreMe 2. Pour e = (€%) € En, posoms €: = €mi, €2 = €m-1,%--+, 


Em = €1.m- 


(i) La représentation U, admet le caractére infinitésimal x, (au sens de (2)) 


défini par 
) fo 


Cm+1,1+ ++ m+i.m 


€2m—1,1 €2m—1,2 





es = 4 én, 
Xe( 2m,1) m | a 





Com.1 + + + €2m.m 











328 JACQUES DIXMIER 
(ii) Si om se limite aux e € E,, tels que €2, €s,..., €m # 0, les représentations 


U, sont irréductibles et deux & deux inéquivalentes. 
(iii) Si F est une fonction intégrable sur Gm, on a 


J \rerae = (2x) ™ f. e feu. de — ._ — = 


Démonstration. Si 
1 °) 
<= i 1 = Aon; 


(U.(x)f) (y’, 2’) = f(y’, 2’) exp i tr(e 2’ wy’). 


la formule (9) devient 


Donc Il’opérateur différentiel correspondant a |’élément 


ihe 
w 0 
de a2, est l’opérateur de multiplication par la fonction 


(y’, 2’) > itr(es’ wy’) = itr((y’—' e2’)w). 


Pour y’ et 2’ fixés, la valeur de cette fonction divisée par i est la forme linéaire 
sur a2, définie par y’—' ez’. Cette forme linéaire se prolonge en un homo- 
morphisme ¥ de G(a2,) = U(a2,) dans le corps complexe. La valeur de y 
pour un élément invariant de S(a2,,) est indépendante de y’ et 2’. Ainsi, l’opéra- 
teur différentiel correspondant 4 un élément du centre de U(g2,) est un 
opérateur scalaire. Donc la représentation U, admet un caractére infinitésimal 
Xe, etona 


| 
| 


a % . 
Chin $41,1 « » + Ohn~$4'.4 \* tr(€ @m—341,1) «.. 4 tre Om—s41, 3) 
Xe eee eee ece | = | ee eee eee | 
€2m.1 (ee Miiecs 14 tr(eé €m.1) ... ttre em, 5) | 

a Je aa ee 

. *j 
= |0 0 + V€5-1,m—j+2 0 = 1 €m—j+1 €m—j4+2- + + Em 
1€1 = 0 vee 0 0 


Ceci prouve (i). 


Soit 


= (7°) c Gu: 
Wz 


alors l’automorphisme intérieur de G:,, correspondant a x définit un auto- 
morphisme de A», donc de son dual. Nous allons montrer que, si €2, €3,..., 
ém * 0, x ne peut laisser fixe —, que si x € Aom. Il en résultera ((7); cf. aussi 
(1)) que U, est irréductible. Or, dire que x laisse fixe £, signifie que x laisse 











- 








eee 











REPRESENTATIONS UNITAIRES 329 


fixe la forme linéaire w’ — tr(ew’) sur M,,. Comme p(x) .w’ = zw’ y—', cela 
signifie encore que tr(ew’) = tr(ezw’ y~') = tr(y~'ezw’) quel que soit 
w’ € M,,, donc que e = y~' ez, donc que y = z = 1 (Lemme 1). Ceci établit 
notre assertion. 

Soit e’ = (e’) € Em, avec €2' €3’... €m’ #0 (Od €'y = €'m_541,;). Sie ¥ e’, 
(i) prouve que x, * x.’, donc que les représentations U,, Uy sont inéquiva- 
lentes. Ainsi, (ii) est démontré. 


Soit 
%= ( °) ., F(x) = Fy(y, 2, w) 


W Z 


une fonction intégrable sur G2,,. Pour f, f’ € Le* (Gn KX Gm), on a 
(APS) = J WeGofis?) Fle) dx 


= F(x) ax {f f(y’y, 2 2) f'(y’, 2’) exp i tr(e 2’w yy’) dy’ dz’. 
Gim G 


mXGm 


La fonction 


—1 Po 


(x, y’, 2’) > F(x) f(y’ y, 2’ 2) f'(y’, 2) expitr(es’ wy y 
sur Gon X Gm X Gm est mesurable pour dx dy’ dz’, et 
Sff* \F(x) f(y’ y, 2’ 2) f(y’, 2’) exp i tr(e 2’ wy y’")| dx dy’ ds’ 
= J* |F(x)| dx Sf* |f' y, 2 2)| |f'0", 2")| dy’ de’ < + @. 
On peut donc appliquer le théoréme de Lebesgue-Fubini qui donne 
(10) (UCSF) = SISSS Fi, 2, w)fo’y, 2’2)f' 0’, 2) exp i tr(ce’'wy'y’) 


dydzdwdy'dz’ 
= Sffff Filo’ *y, 2° "s, w)f(y, 2)f (y’, 2”) exp i tr(ez’wy~*) 
dydzdwdy'dz’ 
= SSIS fy, 2)f'’, 2) [Filo y, 2”'s, w) exp i tr(y~*e2’ w) 
dw)dydzdy'dz’ . 


Donc ((2), Lemme 35) 
tr(U.(F)*U.(F)) = SSSf |S Fi’ y, 2’ "2, w) exp i tr(y~"es’w)dw|* dy dz dy’ de’ 


Comme on a identifié canoniquement l'espace vectoriel M,, 4 son dual, la 
transformée de Fourier de (y, 2, w) — Fi(y, 2, w) par rapport a la variable w 
est encore une fonction (y, z, w) > F(y, z, w) sur Gn X Gun X Mm, et on a 


(11) tr(UCF)* ULF) = SIS [Flo y, #2 2, ye 2") |? dy de dy’ de’ 
= ffff \Fo, 2,9 * 9 €2’)|* dy de dy’ de’ 
= Sfff |F(y, 2, y’ e2’)|* dy dz dy’ dz’ 











330 JACQUES DIXMIER 


Pour achever la démonstration, nous aurons besoin d’un lemme. Adoptons 
la notation N,, du Lemme 1. Soit F,, = E,, (\ Nm, c’est-a-dire l'ensemble des 
e = (en) € M,, tels que «x = 0 pourj + k ¥ m + 1, €im €2,m—1- - + €m—12 ¥ O. 
Posons €m,1 = €1,..., €1m = €m- Lout w € N,, s’écrit de maniére unique sous 
la forme w = ze y, avec y © Gm, 2 © Gm, e € Fe (Lemme 1), d’od une bijection 
@ de N,, sur Gn X Gun X Fm. Alors : 


LEMME 3. La bijection  transforme la mesure dw sur N,, en la mesure dy dz 
de, ou 


24 2X 1) 
de = €3€3...€m. dé; deo... dém 
Démonstration. Posons w = (wx), ¥ = (nj), e = (ex), 2 = (Kp). Siw = zey 
ona 
m m 
ms = > » Chr €rs0 Is = > C jr €r.m r+1 IWm—r+1,k 
r= 1 $= rel 
—_— Me 
= § jr €m—r4+1 Nm—1r+1,k- 
r< inf( j, m—k+1) 
D’ot 
dw jx = a (€m- r+1 Im- r+1,% QE jr + ¢ js Nm—r+1 x dem r+1 


1<r< inf( j.m k+1) 
-f- - 1 
§ jr €m—r+1 21m r+1.K)- 


On a en particulier dw;, = de». Supposons démontré que 


(12) Tl] dwn = + e_pre4 pease Il dy x)( Il de;) 
1<qk< jp 


jJ<p. kom—pt+l m—p+1l<j<m 
( I] dn.) ; 
m—p+1l<reejcm 
On a alors 


( deo jx) dw}, m—p 


j<P. a p+l 


=+ , or Se ae: ( I] dt )( I] de;)( I] dn.) 
m—p+ 


1<k< jap m—p+\l<q jem lek< jem 


(Mm. m—p dem + €m dnm m- ) 


2 4 2(p—1) Il 
= + €m—p+2 €m—p+3 +--+ €m ( dt n)( de s)( dn) 
1<k< j<p m—p+1<j<m m Bie j<m 


€m4Nm m—p- 
Supposons démontré, pour un entier g tel que 1 <q < p, que 


(13) ( I] deo) a a ae 


j<p kam—p+l 
2 4 2(p—1) 
= + €m—pt2€m—pid +++ Em ( I] dr n)( I] de;)( I] dn) 
1<ak< jap m—p+1<j<m m—p+l<r< jem 
€m €m—1 + + + €m +2 Nm m—p dnm l.m—p++- dnm @+2,.m—p- 


Alors 





ee eee 





(1 


) 
( 
i 





I 





REPRESENTATIONS UNITAIRES 


(14) ( I] deo) ee 


j<p. kom—p+1 


2 4 2(p—1) 
mm + 6a 5t2En—ptd- >> Ge ( l] dr »)( | de;)( | dn) 
1<k< j<p m—p+1< j<m m—p+ lek< jem 


€mEm—1 + + + €m ¢+20%m,m—p eee Dm—¢+2,m—p (Em—e+ 10%m—¢+1,m—p) 


Le passage de (13) a (14) prouve, par récurrence sur g, que 


dos.) ae 


a. k>m—p+1 
2 2(p—1) 
= + én-pi2... Cx em—pti- « tml I] dt n)( I] de;)( I] dn») . 
1<k< j<p m—p+1<j<m m—pek< j<m 
On passe de méme de 1a, par récurrence, a la formule 


( I] deo ,)d1,m p «+ + dy, m—plidysi.m - » - Up+1,m—p+1 


iS<p. kam—p+1 
2 4 2p 
= + €m—p+1€m—p+2 +++ €m ( I] dt n)( I] de,)( I] dn) . 
1<€k< j<p+l m—p+l<j<m m—p<r< j<m 
Enfin, en multipliant par dwy41,—», il vient 


(15) Tl dio, = & tenis ee ( Il ds »)( Il de;) 


j<p+ 1. k>m—p 1<k< j<p+l m—p<j<m 


(_ Th irs) 


m—pPek< jem 
Le passage de (12) 4 (15) prouve, par récurrence sur p, le Lemme 3. 
Ceci posé, revenons a la formule (11). Elle entraine 


J tue ULF) de = f | | J f IF(y, 2, yea") | dy de dy! ds! de 
Fu Gm Gm GmV¥ Gm Fm 


-{ | |F(y, 2, w)|* dy dz dw -f J | |F(y, 2, w)|’ dy dz dw. 
Gm¥ GmV Nm Ga Ga¥ Mm 


D’ou, utilisant la formule de Plancherel sur le groupe abélien M,,, 


f tr(U.(F)* U.(F)) de en f | f |Fi(y, x, w)|* dy dz dw 
GmVY¥ Gm Mm 


Em 
ny f | F(x) |? dx. 
Gim 


Ceci achéve la démonstration du Théoréme 2. 


3. Cas ot n est pair. Caractéres globaux des représentations U/,. 


LEMME 4. Soient P, (x1,...,Xr),..., Ps (%1,...,%,) des polynémes a 
coefficients réels. Soit ¢ l'application de R’ dans R* définie par les égalités 
Mi = Py (x1,...,X7),.--5 Ve = Py (X1,...,%,). On suppose qu'il existe des 
constantes A > 0,6 > 0, telles que \\e(x)|| > A\\x\\* pour x € R’, |\x|| > 1 (om 
pose ||x|| = |xi| +... + |x|, |ly|| = lol +... + ly,| pour x = (x1,...,x,) 














332 JACQUES DIXMIER 


€ R’, y = (y1,..-, 9s) € R*). Alors, si f € S(R*) (avec les notations de (8)), 
on a foe € /(R’), et l' application f > foe de /(R*) dans /(R") est continue. 


Démonstration. Soit f une fonction numérique quelconque sur R’. On a, 
pour tout a > 0, 


SUPseR’, | 21 |>1/ (fod) (x)| ||x||*< A” supeerr, jizii>alf(e(x))| |] e(x)||"" 

< A~*" supyens|f(y)| |Iy| |": 
Supposons maintenant que f € -“(R*). Alors, fog est indéfiniment dérivable 
sur R’. Toute dérivée partielle D(fog) de foe est somme de termes de 
la forme Q. ((D’f)og) od Q est un polynéme et od D’ est une dérivation 


partielle (on le voit aussit6t par récurrence sur l’ordre de D). II existe des 
constantes B > 0, « > 0 telles que ||Q(x)|| < B\|x||* pour ||x|| > 1. Alors 


supserr.isti>slQ(e)((D'f)o#) (x)| || ||" < B supzerisi:>11((D'f)o)(x)| fx| |" 
< BA~" supyexs|(D’f)(9)| |ly||**?” 
D’autre part, 


SUPzer’, |i211<1|Q(x) ((D’f)og) (x)| |\x||* < (suprerr, |iz11<1|Q(x) |) 


Supyers|(D’f) (y) |. 
Ceci prouve le lemme, en utilisant la définition de la topologie de (R’) et 
S (R*) par semi-normes. 

Soit maintenant x = (&%) — F(x) = F((& )) une fonction sur G,. (Rappel- 
ons que §, = 0 pour 7 < k et que &,;, = 1 si x € G,). Dire que F est indéfini- 
ment dérivable sur G, revient a dire que F est une fonction indéfiniment 
dérivable des & (7 > k). Si de plus F est une fonction indéfiniment dérivable 
a décroissance rapide des §, (j > k), on dira que F est indéfiniment dérivable 
4 décroissance rapide sur G,. Il revient au méme de dire que F, transportée 
sur g, grace a l’application exponentielle (qui est un isomorphisme de la 
variété différentiable g, sur la variété différentiable G,,), devient une fonction 
indéfiniment dérivable 4 décroissance rapide au sens usuel sur l'espace vectoriel 
Qn- 

THEOREME 3. Soite = (€n) € En; posoms €: = €m,1,.- +5 €m = €1.mi SUPposons 
€o €3... 6 #* O. 

(i) Soit 
y 


y 0 Bl 
c= ( Fe) = F,(y, 2, w) 


une fonction indéfiniment dérivable a décroissance rapide sur Gam. Alors, I’ opéra- 
teur U,(F) est un opérateur @ trace. 
(ii) Il existe une distribution tempérée T, sur M,, telle que 


tr(U.(F)) = f Fi(1, 1, w)dT.(w). 


(iii) Sur ensemble N,, des w € M,, tels que A2(w)A;(w)... A, (w) ¥ 0, T, 
coincide avec la fonction 





a ere > 


- ae a COU 





LS 


l= 





REPRESENTATIONS UNITAIRES 333 





w— +) pannantied hal a 7 
; An—1(w) An-2(w) _ m+1 Ai(e) 
exp | ada(w) *"an(w) + dew) tS) aw) 





|\Ae(w)A3(w) .. . An(w)| 


Démonstration. Comme dans la démonstration du Théoréme 2, introduisons 
la fonction (y, z, w) Fry, z, w) sur G, X Gn X Mn, transformée de Fourier 
de la fonction (y, z, w) — Fi(y, z,w) par rapport a la variable w. La formule 
(10) prouve que U,(F) est défini par un noyau (y, 2, y’, 2’) ~ K(y, 2, y’, 2’) 
sur (Gn X Gn) X (Gu X Gm), ce noyau étant donné par la formule 


K(y, 2, y', 2’) = F(y'—y, 2’—'2, y~"es’). 
Nous allons montrer qu’il existe des constantes A > 0 et 6 > 0 telles que 
(16) —||y’~*y|| + ||2’~48|| + Ilym*es"|| > Allyl] + Ilell + |!y’ll + |l2"l)) 


(od l'on pose |\y|| = Z,>2\n| pour un élément y = (n») de G,, et |\w|| = Llwy» 
pour un élément w = (w,) de M,,). Il est immédiat qu'il existe des constantes 
A, > 0, Az > 0, 6; > 0, 52 > O telles que 


ly" < allyl" [lyel| < Aa(l [yl] + |lel|)” 


quels que soient y € G,,, z © Gm avec |\y|| > 1, |\z|| > 1. D’autre part, la 
démonstration du Lemme 1 prouve qu’il existe des constantes A; > 0, 6; > 0 


telles que 
3 


lly || < Aal|y~“e2"||’ 
quels que soient y € Gn, 2’ € Gy. (Rappelons que e € E,, est fixé et que 
€2...€m #0; on tiendra compte aussi du fait que ||y~'ez’|| > |e,| > 0). 
Prenant les transposés, on en déduit l’existence de constantes A, > 0, 6, > 0 
telles que 

II2"|| < Aally*ea"||"* 


quels que soient y € G,, 2 € Gm. D’od l’existence de constantes As; > 0’ 
5s > 0 telles que 

Ly’ 1) = Oy) YT < Ast ly yl] + yee" 1)", 

llz|| = ||@’("s)) || < As(||2”"2|| + |y~*es"||)* 
quels que soient y, y’, z, 2’ € G,,. On en déduit enfin l’existence de constantes 
Ag > 0, 56 > 0 avec 


[Io] + Vel]  bo’l] + lle’l| < Ae(lly yl] + [le "2|| + 11@7ee"||)" 
quels que soient y, y’, z, 2’ € G,. D’od (16). 
Comme F est une fonction indéfiniment dérivable 4 décroissance rapide 


sur Gn X Gu X My, le Lemme 4 prouve alors que K est une fonction indéfini- 
ment dérivable a décroissance rapide sur (Gx, X Gn) X (Gu XK Gm»). L’opérateur 











334 JACQUES DIXMIER 


U.(F), défini par le noyau K, est donc un opérateur d’Hilbert-Schmidt. 
Posons X = G,, X Gm, de sorte que K ¢ S(X X X) avec les notations de 
(8). L’application canonique de -”(X) dans L?(X) (il s’agit de l’espace L* 
pour la mesure de Haar canonique de G,, X G,,) définit une application 
canonique de -/(X) @ -/(X) dans L?(X) ® L?(X) (cf. (6) pour les notations 
utilisées ici concernant les produits tensoriels topologiques). Or, comme -“(X) 
est un espace nucléaire, on a “(X) @ “(X) = S(X X X), d’od une appli- 
cation canonique A :S(X x X) — L*(X) @ L*(X): d’autre part L?(X) 
® L*(X) s'identifie a l'ensemble des opérateurs a trace dans L?(X), et il est 
immédiat que A(K) n’est autre que l’opérateur défini par K, c’est-a-dire 


U.(F). D’ow (i). Par ailleurs, si K, € “(X X X), on a 
tr(A(K,)) = fKilé, £) dé; 


c'est évident si K, est un noyau élémentaire de la forme (£, t’) — ¢()¢’ (£’), 
Gf . 

od ¢ € /(X), ¢’ & 

continuité. Donc 


(17) tr(U,(F)) = S{KG, 2, y, 2) dy dz = sfFa, 1, yez) dy dz. 


Or l’application (y, z) — yez de G, X G, dans M, transforme la mesure 
dy dz en une distribution tempérée sur M,, (a cause du Lemme 4 et des 
inégalités 


CS, ‘ ‘ , . s ° s “42 
S(X); et le cas général se déduit de 1a par linéarité et 


—1 é3 -1 6 
Ilyl| < Aslly” e2||"*, ||z]| < Aally” es"); 
soit JT, la transformée de Fourier de cette distribution, qui est aussi une 
distribution tempérée; on a 


tr(U.(F)) = fFi(1, 1, w) d7,(w). 
D’od (ii). 
Ona 
F(1, 1, yez) = f Fil 1, 1, w) exp 7 tr(wyez) dw 


= frida, 1, wy—') exp 7 tr(wez) dw. 
Posons z = 1 + 2°, de sorte que 2° parcourt l’espace vectoriel g,,. On a alors 


F(1, 1, yez) = srid, 1, wy—') exp 7 tr(we) . exp 7 tr(wez°) dw 
= { ©(w) exp i tr(wez*) dw 
en posant ®(w) = F,(1, 1, wy—') expitr(we). La 


indéfiniment dérivable 4 décroissance rapide 
z= (S%). Ona 


fonction w— ®(w) est 


sur M,,. Soient w = (w,), 


tr(wez’) 


x @ jx €k, m—K+1 Sm—k+1, j 


j+k<m+l1 


ys ® ik €m e+1 Sm—k+1, j- 


j+k<m+1 


Donc 





a 





et 


es 


1e 








REPRESENTATIONS UNITAIRES 335 


F(1, 1, yez) 


Ee © ( (wn) 1< sem, 1cnem) (EXP yas  ptm—e+ibm-n+1.3) 


j+k<m+1 
dw, .. . diem 
= f ++ J ¥ (ose) srncme) (exp i > @ sx Em +1 Smet.) 
j+k<m+l 


avec 


V( (wy) sencme1) =f... f O( (Wy) rcscmrcecm) [] do. 


j+kom+1 
La fonction ¥ est indéfiniment dérivable 4 décroissance rapide, et l'on a 
f F(1, 1, yez)dz 
= f Ry: a I] Gnesi.) va J ¥( (wn) s+ncmes) (exp 1 » W jk €m—e-1 Sm—k+1 ’) 


j+k<m+ j+k<m+1 
I] dw jp 


j+k<—m+l 


= ee inte Oe ae ok I] tm ovt.a) «+f @( (een) 0m) 


j+k<m+ 
(exp t os W jz om—e+ 1) I] dw jp 
I 


j+k<m+l 


9 


= (20) ls 'ss .. Ge” | Y(0) 
= (24) les es. | J O(0)dt 


od l'on fait varier ¢ dans l'ensemble 6,, des matrices (r,,) M,, telles que 
Tx = 0 pour 7 + k < m + 1, et of dt désigne la mesure définie par la forme 


différentieile 
I] dr jx 


j+k>m+i1 
Ainsi 
ms ‘ }m(m—1) 1 —2 m+1 : 1 ° 
f FU, 1, yez)dz = (27)’ * wre” f F,(1, 1, ty” )exp 7 tr(te) dt. 
Toute matrice t = (rx) © BD» telle que 7; 72,.m—1---Tm.1 # 0 se met de maniére 
. ; s c , * , , . * e . > 
unique sous la forme ze’, ol e& = (e€,') © E, et o z = (f%) © G,,. Posant 
I J 
€n—j41.9 = €7, ON A Tye = Fj m—egs x, GOD &’ = Te -441.4. Dore 
tr(te) = , # Tj m—j+1 €j = z Cm—j+1 €; 
2 1 
dt = |i, ... de... dé, dz 
ou, compte tenu du Lemme 3, 
s—1 _p—2 r—m+1 , 
2.) + ae A de'dz. 


Par suite 











336 JACQUES DIXMIER 


f FQ, 1, yes)de = (24) fess? . "| ff Fi, 1, 2e’y™) 
fous i>, jms ¢) les ee. et" | de'ds. 


Soit F; la fonction w— F; (1,1, w) sur M,,. Supposons maintenant que le 


support de F, soit compact et contenu dans N,,. Alors, la fonction 


w— F,(w)|A2(w)—'A;(w)—!... A, (w)—'| est intégrable pour dw. Donc la 
fonction 
(z, e', y) — Fo(ze’y) |e ef. 


est intégrable pour la mesure dy de’ dz. Donc 


Sf PQ, 1, yes) dydz = (24) les"? PPP Fi, 1, ze’ 


+ €m 
, ol pal —m+1 
(exp i p> cm—p41 63) [ed  ...€m | dyde'ds 
(m—1);_—1 —2 —m+1 
= (24) less.» ee” | f Fi(1, 1, w) 


exp a, — € Am—1(w) + ...+ (—1)"*"6 ai(w) | 


An(w) iL dep 
A2(w)A;3(w) coe An(w) | 





ce qui prouve (iii). 
Remarque. Sauf dans le cas trivial ol m = 1, la fonction w — A,(w) 


4,,(w)—! est non localement intégrable pour dw, de sorte que 7, n'est pas une 
mesure. 


4. Cas od n est impair. Centre de 11(g,,.). Nous poserons alors nm = 2m+1. 
Tout élément x de Gon+; se met sous la forme 


70/0 
x =| uj1/0 
w\v|z 


ot y € Gn, 2 € Gn, w € Mp, et od u (resp. v) est une matrice a 1 ligne et m 
colonnes (resp. 1 colonne et m lignes). Si 


y 00 
x’=\|u’' 10 
wv’ 2’ 
on a 
yy’ 0 0 
xx’ =| uy’ + wv’ 1 0 


wy +ou'+2w v+e2v' 22’ 
D’ov facilement 








—_—— 





REPRESENTATIONS UNITAIRES 337 


y 0 0 
x =| —uy” 1 0 
= =i - a 
z (vu — w)y —z v 
et 
yyy" 0 0 
(18) xx'x =| (uy +u'—u)y' 1 0 
t v+eav' —22's'v 2's 


avect = (wy +0u'+2w —vu—2vu+e22 cou —22' 2-' w)y. On 
voit que l'ensemble Aon4, des x € Gomy: tels que y = z= 1, u =v = 0 est 


un sous-groupe distingué abélien de Gon4:. L’idéal abélien dons: de Qom+1 
correspondant A Ao»; est l'ensemble des matrices de la forme 


0 0 0 
0 0 0 
w 0 0 


THEtorEME 4. Le centre de U(G@om+1) est engendré par les éléments algébrique- 
ment indépendants 


€2m,1 €2m ,2 Cm+2s1 +++ Em+2 
€2m+1,1» | ’ 


ese bheetoe #68 68% 
| 


€2m+1,1 €2m+1,2 | CQm+1,1 +++ €2m+1,m 


Démonstration. Comme dans la démonstration du Théoréme 1, on prouve 
d’abord qu’un élément de S(g2,41) invariant pour la représentation adjointe 
est dans S(aom+41). Soit p la représentation adjointe de Gomi; dans Qem+1. 
Identifions Qon4; 4 M, par l’application 


00 0 
0 0 0) 
w 0 0 
Alors, si 
y 0 O 
x=i|4 0) € Ges 
wv zs 
ona 
00 0 y 0 O\f/O O O\f/y O O\- 
p(x).w=p(x).{0 0 OJ =tu 1 O00 0 Ou 1 O 
w 0 0 wv 2/\w 0 O/\w vo 2 
0 0 0 
=1|0 0 Oj=zewy. 
zwy' 0 0 


Le raisonnement s’achéve alors exactement comme pour le Théoréme 1. 











338 JACQUES DIXMIER 


5. Cas od n est impair. Formule de Plancherel. Nous noterons A241’ 
l’ensemble des x € Gems: tels que y = z = 1, v = 0. La formule (18) montre 
que Aom4,:’ est un sous-groupe distingué abélien de G2,,4:. Soit @on+1’ l’algébre 
de Lie de Aom41’, c’est-a-dire l'ensemble des matrices 


0 0 0 
uO 0 
w 0 0 


L’application exponentielle de do,4:’ sur Aom4:’ est un isomorphisme. Donc, 
pour e € E,,, l’application 


0 0 
1 0}]—exp2tr(ew) 
0 1 


gee 


est un caractére &, de Aom,:'. Nous noterons U, la représentation unitaire de 
Gom+1 induite par &,. 
Tout élément de Gon: se met de maniére unique sous la forme 


1 0 O\/y O O y 0 0 
u lOO 1 OJ ={uy 1 O 
w QO 1/\0 v 2 wy uv zz 


Ainsi, Gom41 est produit semi-direct de A2n4:’ et d’un groupe canoniquement 
isomorphe 4 Gp X Gn+i. En outre 


y 0 O\f/y 0 O yy 0 0 
(19) 0 1 Ou 1 OJ =tu 1 0 
0 vw 2/\w v 2 vuts:’w v’t+e2'ov 22 
l 0 O\/yy O 0 
=| uy'!y’-! 1 OFO l 0 
(vX' u+2’w)y"y-' 0 1/\0 v+2v 22 


Par suite, U, opére dans Le? (Gm KX Gmnsi), Gm KX Gngi étant muni de la 
mesure de Haar canonique; et, si (y’, 2’, v’) — f(y’, 2’, v’) est un élément de 
Le?(Gnu X Gm), la formule (19) prouve que, pour 


y 0 O 
x= u l OVE Gom+t 
wv z 


on a 
(20) (U.(x)f)(y’, 2’, v') =f(y’' y, 2 2,0 + 2’ v) exp tr(e(v’ u + 2’ w)y-' y’ 
La formule (20) définit explicitement la représentation U,. 


THEOREME 5. Pour e = (€x) © Em, POSOMS €; = Emi, sss y €m 


= €1.m- 


(i) La représentation U, admet le caractére infinitésimal x, défini par 





———- - 


oo oan ities 





de 


nt 


de 

















REPRESENTATIONS UNITAIRES 


€2m,1 €2m,2 








Xe(Com+1,1) = lem, xf 
€2m+1,1 €2m+1,2 


Cm+2,1 + + + Cm+2.m 
-m™ 
Xe eee eee ees = 1 €1€2... Em 

Com+1,1 + + + CQm+1,m | 

(ii) St on se limite aux e € E, tels que €; €2... €m #0, les représentations 
U, sont irréductibles et deux & deux inéquivalentes. 
(iii) St F est une fonction intégrable sur Gomsi, on a 
. 12 2 y . , . 3 2m 
{ |F(x)|" dx = (2x) " ™ f. iia f tx(c «(F)*U,(F))\ae...6. 
“ Gam+1 e e 
de, déz . . . dé». 


Démonstration. Si 


1 0 O 
x={0 1 O} € Aaa, 
wt i 


la formule (20) devient 


-2 
Ef €n—i€m, +--+ 


(U.(x)f)(y’, 2’, v’) = f(y’, 2’, v’) expitr(e2’ wy’). 


La démonstration de (i) s'achéve alors exactement comme pour le Théoréme 2. 
Pour prouver (ii), on raisonne aussi comme pour le Théoréme 2. II s'agit 


de prouver que, Si €; €2... €m # 0, un élément 
y 00 
x= u l 0 Gom+1 
woe zs 


qui laisse fixe —, appartient A A2,,,’. Or la condition que x laisse fixe &, se 


traduit, en vertu de (18), par la condition 


(21) tr(ew’) = tr(e(v u’ + 2 w’)y") 


quels que soient’w’ € M,, et la matrice u’ a 1 ligne et m colonnes. Faisant 


u’ = 0, ceci impose d’abord tr(ew’) = tr(y~'ezw’) quel que soit w’ 
donc e = y~'ez, donc (Lemme 1) y = z = 1. La condition 
alors 4 tr(ev u’) = 0 quel que soit uw’. Posant 

o1 

° ’ , , 

v=|. ue = (o1...Om) 

om 

ona 
, , , , 
vu’ = (o;0, ), CVU = (€5 m—j41 Tm—j41 Ge ), 


tr(evu’) 


> , 
Em $+1 7m j+1 Fy, 
/ 


(21) 


se 


Mn, 


réduit 








340 JACQUES DIXMIER 


d’od la condition €,—j:1¢m-j41 = 0 (j = 1,...,m); comme les ¢, sont tous 
# 0, on en conclut que v = 0, d’od x € Aany1’. 

Soit x — F(x) = F,(y,2,v,u,w) une fonction intégrable sur Go,4:. Pour 
tf’ € Le? (Gu X Gn+i), on a 


(22) (U.(F)f\f’) = 
SSSI Fry, 2, 0, u, w) f(y’ y, 22, 0’ + 2’ 0) f'(’, 2, 0’) 
exp i tr(e(v’ u + 2’'w)y~* y’~") dy dz dv du dw dy’ ds’ dv’ 
=f...f Fily*y, 2°", 0, u, w) f(y, 2, 0’ + 2’ 0) f(y’, 2, 0”) 
exp i tr(e(v’ u + 2’ w)y~") dy dz dv du dw dy’ dz’ dv’ 
=f...f Fily*y, 2", 2’ "(v — 0’), u, w) f(y, 2, 0) f(y’, 2,0’) 
exp i tr(e(v’ u + 2’ w)y~") dy dz dv du dw dy’ dz’ dv’. 
Donc 
tr(U.(F)*U.(F)) =f... S\f Filo’, 22, 2’ — 0’), u, w) 
exp 7 tr(e(v’'u + 2’w)y—') du dw|* dy dz dv dy’ dz’ dv’. 


Comme plus haut, nous identifions l’espace vectoriel M,, 4 son dual grace 
a la forme bilinéaire (w:, w2) — tr(w; w2). D’autre part, l’espace vectoriel 
M,,., des matrices u 41 ligne et m colonnes admet un dual que nous identifions 
a l’espace vectoriel M,,,, des matrices v 4 m lignes et 1 colonne, en posant 


<u,v> =tr(vu). 
Alors, la fonction 
(y, 2,0, 4, w) — F,(y, 2, v, u, w) 


définie sur Gn X Gm X Mim X Mana X Mm, admet, par rapport aux variables 
u et w, une transformée de Fourier, que nous noterons F, définie sur 
Gu X Gu X Mim KX Mim X Mu; et l'on a 


tr(U.(F)*U.(F)) = f.. | F(y’—y, 2’, 2’-"(v — 0’), y~tev’, yer") |? 
dy dz dv dy’ dz’ dv’ 
= f. ; S\Fo, z,2)—"(v — ov’), y~'y'—"er’, y~"y""e2’) |? 
dy dz dv dy’ dz’ dv’ 
=f.. S\Fo, z,2/"(v — v’), y’ev’, y’ez’)|? dy dz dv dy'dz’ dv’ 
= f.. f\F(y, 2,0, yer’, y’ex’)|* dy dz dv dy’ dz’ dv’. 
Pour €; €2...€m #0, on a det(y’ e) = dete # 0. Donc 
tr(U,(F)*U,(F)) = f. , S\F(y, z, v, v', y’ez’)|*\det e|—' dy dz dv dy’ dz’ dv’. 


Alors, en utilisant le Lemme 3, 





Le 








eee ol 





" ee 








REPRESENTATIONS UNITAIRES 341 


f ei a tr(U,(F)* U.(F)) \even. . . "| deydes . . . dem 
= f ‘a .f \F(y, 2, v, v’, y’ex’) "Ge. en” dy dz dv dy’ dz’ dv’ de, dex. . . dém 
= j ... J |F(y, z, 2, 0’, w)|* dy dz dv do’ dw 
= (24)™*™ f ... Jf |F(y, 2, 2, u, w) |? dy dz dv du dw 
= (20)”*™ [| F(x)|* dex. 


Ceci achéve la démonstration du Théoréme 5. 


6. Cas od n» est impair. Caractéres globaux des représentations U,. 


THEOREME 6. Soite = (€%) € Em; Posoms €: = €m.1,.-. 5 €m = €1,.m3 SUpposons 
€1€q. . . Cm * O. 
(i) Sott 
» © Gg 
x=iu l 0) 0 = F, (y, 2, 0, u, w) 
wv ez 


une fonction indéfiniment dérivable a décroissance rapide sur Gom+s;. Alors, 
lopérateur U,(F) est un opérateur a trace. 
(ii) Il existe une distribution tempérée T, sur M,, telle que 


tr(U.(F)) = J Fi(1, 1, 0, 0, w) d7,(w). 


(iii) Sur l'ensemble N,, des w © My, tels que A2(w)A3(w)... A,(w) # 0, T, 
coincide avec la fonction 





w— (2a) ere”. oS | 
exp ete) — eB +. + (re HO] 





|\Ao(w)As(w) ... An (w)| 
Démonstration. Comme dans la démonstration du Théoréme 5, introduisons 
la fonction (y, z, v, u, w) > F (y, 2, v, u, w) sur Gn X Gn X Mim X MimX Mm, 
transformée de Fourier de la fonction (y, 2,0, u, w) — Fi(y, 2,0, u,w) par 
rapport aux variables u et w. La formule (22) prouve que U,(F) est défini 
par le noyau 


K(y, 2,0, y’, 2’, v’) = F(y'-y, 2/—'2, 2/—"(v — v’), yen’, yer’). 


Comme dans la démonstration du Théoréme 3, nous introduisons les notations 
llyl!!, ||wl| pour les éléments de G,,, M,, et nous posons ||v|| = Z,/\r,| si 
v = (r;) € Mi. On a vu qu'il existe des constantes A; > 0, 5; > 0 telles 
que 

ly” *yl | + [lee] + [ly "e2"|| > Aa(llyl] + liel| + Ily’ll + [le"l1)": 
d’autre part, il existe des constantes A», > 0, 6, > 0 telles que 


$2 


[lo"|| = lle"x@ev") || < Aa(|lyl] + Ilo *er"||) 











342 JACQUES DIXMIER 


et des constantes A; > 0, 6; > 0 telles que 
llol] = |Is(e"*@ — 0) + 0'|1< Aa(|le"|| + [I'l] + [lz "@ — 0°)” 


Finalement, il existe des constantes A > 0, 6 > 0, telles que 


Ilas’—1 


ly’ *y|| + ||2’—*s|| + ||2’-"(o — v’)!| + |lymten’|| + ||ymtes’ 
>A (Ilyl| + liy’!| 4 [lel] + [2/1] 4 |loll 4+ |!0’||)4 
D’aprés le Lemme 4, K est donc une fonction indéfiniment dérivable 4 décrois- 
sance rapide. Par suite, U,(F) est un opérateur A trace, et 
tr(U,.(F)) = fff F (1, 1, 0, yev, yez) dy dz dv. 


L’application (y, z,v) — (yev, yez) de Gm XK Gn X Mim dans Mim X Mn 
transforme la mesure dy dz dv en une distribution tempérée sur M,,, XK Mn 
d’aprés le Lemme 4 et les inégalités obtenues plus haut; soit 7,’ la transformée 
de Fourier de cette distribution, qui est aussi une distribution tempérée; 
alors 

tr(U.(F)) = f F(1, 1,0, u, w) dT%(u, w). 
Ona 


F(1, 1, 0, yev, yes) = sfra, 1,0, u, w) exp tr(yevu + yezw) du dw 
SF (1, 1, 0, yev, yez) dv 
= Sdoff Fy (1, 1,0, uw, w) exp 2 tr(yevu + yezw) du dw 
= f det e|—! dv ff Fi(1, 1,0, u, w) exp i tr(vu + yezw) du dw 
f\det e|-' dv ff Fi(1, 1, 0, u, we-"y-") exp i tr(ou + zw) |\det e|-" du dw 
det e|-'-"{ dof f Fi (1, 1,0, u, we 


y~') (exp 2 tr w) 


(exp i tr(vu + 2°w)) du dw 
en posant z = 1 + 2°. D’od 
fFa, 1, 0, yev, yez) dv = (2r)™\det e —1—m [Fy (1, 1, 0, 0, we-*y-") 
(exp i tr w) (expi tr 2° w) dw. 
Puis, raisonnant comme pour le Théoréme 3 
ff FG, 1,0, yev, yez)dedv = (24)"\det ef "(24)" 
f F,(1, 1, 0, 0, te~*y~") (exp i tr t) dt 


ou ¢ parcourt l'ensemble des matrices (r,) telles que 7 = 0 pour j < k et 
od dt est définie par II ,5,dr,.. Posons te~' = ze’, ob z © G, ete’ = (€’) © En, 
et €n—341,, = €,. Il vient 


tr ¢ = tr(ze’e) = tr(e’e) = ) » Cn $41, € 3 
J 


= 2 
dt = |e,...€m||(€rem)” (€2x€m—1)” —- - - (€m—2€2) |e. . . de, dz 


| m m—1 1) e—1 p—2 r—m+1 I te 
mm 16:60 ..-Emii€g €3 «0 0 Cm de‘dz. 








———_ EE 


Si 


SJ 











oe eee a 











REPRESENTATIONS UNITAIRES 343 


D’od 


ffFa, 1, 0, yev, yer) dadv 
= (20) eres? . a" | ff Fi(1, 1,0, 0, ze'y"*)(exp i = <¢ +1€)) 
J 


1 1 
lg. | de'ds. 


Supposons maintenant le support de la fonction w — F, (1, 1, 0,0, w) com- 
pact et contenu dans N,,. On a 
fff F(1, 1, 0, yev, yez) dy dz dv 


= (24) era? ea fff Fi, 1,0, 0, ze'y*)(exp i > én—s+1 €)) 
P| 


1 +1 
€ ...€ |dyde'dz 


= (24) ler "| f Fi(1, 1, 0, 0, w) 


‘ A,(w) 
. An(w) +...+ -1)", 22) | 
— piles a 42(w) 





[A2(w) .. . An (w) —_ 

Remarques. 1. Sauf dans le cas m = 1 (cas qui est étudié dans (4)), on voit 
que la distribution 7, du théoréme n'est pas une mesure. 

2. Soit G un groupe de Lie nilpotent simplement connexe. La conjecture 
suivante parait vraisemblable : les caractéres globaux des représentations 
unitaires irréductibles de G sont des distributions tempérées sur G (la notion 
de distribution tempérée se définissant grace a l’application exponentielle). 


Errata a un article antérieur :* 


(1) P. 322, 1.4, la conjecture que 3(q) est de type fini est inexacte : on le 
voit en utilisant le contre-exemple au 14‘ probléme de Hilbert que vient 
d’obtenir Nagata. 


(2) Les algébres de Lie définies pp. 322-3 sont, pour les groupes de Lie 
définis pp. 330-1, les algébres des champs de vecteurs invariants a droite. Or, 
dans l'article précédent de cette série, on a toujours utilisé les champs de 
vecteurs invariants 4 gauche. II en résulte qu'il faut changer M, en — M, et 
M:z en — Mz dans les formules (9), (12), (15), (18), (21), (24), (27), (30), 
(33) et dans leurs démonstrations. 


(3) Dans la formule (25), les 4 derniers termes du crochet doivent étre : 
— (dr? + pu”) ps0 — upp — 4 du(pi1 — p2)0 — 4(A* + wu) (ups — Ap2)O”. 


P. 341, 1.17, le dernier terme doit é@tre : 4Au(p:? — p2”). Dans la formule 
(33), il faut : 


Uy(x3) = — iAM2 + 31M} Uy(x4) = — idkM, 


*Sur les représentations unilaires des groupes de Lie nilpotents. III, Can. J. Math., 10 (1958), 
pp. 321-48. 











344 JACQUES DIXMIER 
Dans la formule (34), il faut 


, 2 = 
exp in( — pis + § pe: + 3 poi — 6 Pe — po + pists) : 


BIBLIOGRAPHIE 


1. F. Bruhat, Sur les représentations induites des groupes de Lie, Bull. Soc. Math. France, 84 
(1956), 97-205. 

2. J. Dixmier, Sur les représentations unitaires des groupes de Lie nilpotents, II., Bull. Soc. 
Math. France, 85 (1957), 325-88. 

3. I. M. Gelfand and M. A. Neumark, Unitdre Darstellungen der klassischen Gruppen (Berlin, 
1957). 

4. R. Godement, Mémoire sur la théorie des caractdres dans les groupes localement compacts 
unimodulaires, J. Math. pures et appl., 30 (1951), 1-110. 

, Théorie des caractéres, II. Définition et propriétés générales des caractéres, Ann. 
Math., 59 (1954), 63-85. 

6. A. Grothendieck, Produits tensoriels topologiques et espaces nucléaires, Mem. Amer. Math. 
Soc., 16 (1955). 

7. G. W. Mackey, Imprimitivity for representations of locally compact groups, I., Proc. Nat. 
Acad. Sc. U.S.A., 35 (1949), 537-45. 

8. L. Schwartz, Théorie des distributions, I1 (Paris, Hermann, Act. Sc. et Ind., 1122 [1951)). 





Institut Henri Poincaré 


Re ll 











rns 


EES 


_ 


— — a 








| 











NOTE ON GENERALIZED WITT ALGEBRAS 
RIMHAK REE 


Introduction. Throughout this note K will denote a field of characteristic 


p > 0. Let I be the set {1, 2,..., m}, and Ga finite additive group of functions 
on J with values in K. We assume that © is total in the sense that, for any 
Ai, - + Am in K, }> pax"Ayo(t) = O for all o in G implies all A, = 0. It is clear 


that @ is an elementary p-group. Let p" be the order of G. A generalized 
Witt algebra 2% is defined as an algebra over K with basis elements {e(c, 7) 
o € G,i € I} and the multiplication table 


(0.0.1) e(o, t)e(r, 7) = rlt)e(o + 7,7) — o(fe(o + 7,1). 


£ is a simple Lie algebra except when p = 2, m = 1. 

In the first section of this note we shall prove that the outer derivation 
algebra of a generalized Witt algebra is abelian, assuming that K is infinite. 
We shall see that actually a result of Jacobson (3) is generalized. 

It was shown in (5) that any generalized Witt algebra & can be reformu- 
lated as follows: Let & be a commutative associative algebra over K with a 
unity element, and D,,...,D,, be derivations of & such that: 

(1) (D;, D;) = DD, — D,D, = 0 for all i and j; 

(2) If f € HM and Ay,...,Ae in K are such that Df = Ad for all i then 
f =0 orf is a unit in YW; 

(3) dS i™f DD, = 0, where f, € A, implies f; = 0 for all 7. 

Now any generalized Witt algebra can be regarded as the subalgebra 


L(A; Di, ..., Dm) of the derivation algebra of & consisting of all derivations 
of the form f,Di + ...+ fmDm. In the second section of this note we shall 
consider 2(U; D,,...,D,) under the conditions (1) and (2) above only, and 


extend some results proved in (5). 


1. The derivation algebra of a generalized Witt algebra. We prove 
the following 


THEOREM 1.1. Let 2 be a generalized Witt algebra over an infinite field K of 
characteristic p > 2. Let {e(o, i)|\o € G,i € I} bea basis of 2. Then any deriva- 
tion of 2 is the sum of an inner derivation and a derivation 6; given by 
(1.1.1) 5:(e(c, t)) = o(c)e(c, 7) 
where is a linear map of @& into K. 

Received May 12, 1958. This research was supported by the United States Air Force 
through the Air Research and Development Command under Contract No. AF 49(638)-152. 
345 











346 RIMHAK REE 


Proof. First of all we show that we may assume (1.1.2): for any i, | <i<m, 
o(i) = 0 implies ¢ = 0. Suppose (1.1.2) is not satisfied. Since K is infinite 
and @ total, we may proceed as in the proof of Lemma 9.1 of (5, p. 533) 
to obtain an m X m non-singular matrix (8,,) such that if we define o[i] by 


a(t] Xu Bi; a(j), (a ™ l geeee m), 


then, for any 7, ¢[i] = 0 implies ¢ = 0. Define a new basis {e[e, i]}c € G, i € I} 


of 2 by 


elo, 1] = a Bi; e(e, 1). 
j=l 
Then by (0.0.1) we have 
= BisB;, e(o, s)e(r, t) 
D> BisBy: (r(s)e(o + 1, t) — o(t)e(o +17, 5)) 


= ss] de + 1,0] — oft] de + ¢, 5}. 


ela, 2] e[r, 7] 


ll 


Thus {elc,i]} satisfies the same multiplication table as {e(¢,i)} with o(#) 
replaced by o[i]. But here o[i] = 0 implies ¢ = 0. Suppose that the given 
derivation is the sum of an inner derivation and a derivation 4, given by 
5, (ele, 7]) = o(e)elc, 7], where ¢ is an additive map of @ into K. Then clearly 
we have 4;(e(¢, 2)) = o(a)e(c, 7) also. This shows that we can assume (1.1.2) 
from the beginning. 

Now let 6 be the given derivation, and let 


5(e(o,7)) = Z. ¥(o, 1; 7, j)e(o + 7,7) 
t.J 
with coefficients y(¢, 7; 7,7) in K. Then from 


5(e(0, 1))e(o, 2) + e(0, 1)d(e(o, 2)) = o(1)d(e(e, 2)) 


we obtain 
(1.1.3) ¥y(o, 1; 7,7) = y(O, 1; 7,7)7()r (1) 


for 1 ~ j and r ¥ 0, and 
(1.1.4) > 70, 137, f)o(j) + v(o, i; 7, dr (1) = yO, 1; 7, i)r (i). 
) 
By (1.1.3) and (1.1.4) we see easily that 
5(e(o,4)) = DY y(o, i;0, j)e(o, j) 
+ e(c,i) >> >> (0,1; 7, 9)r(1)"e(r, j). 


ro fj 


Hence 6 is the sum of an inner derivation and a derivation 6, of the form 
(1.1.5) 5,(e(c,i)) = >> y(o, i, j)e(o, j) 
P| 


with coefficients y(¢,i, 7) in K. 














te 


3) 


r} 


a 





eee en 


GENERALIZED WITT ALGEBRAS 347 


We shall show that y(¢, i, 7) = Oift # j, thaty(e,1,1) =... = y(¢, mm), 
and that y(¢, 1, 1) is additive with respect to c. If m = 1, then the additivity 
of y(¢, 1, 1) follows immediately from 


5, (e(o, 1))e(r, 1) + e(e, 1)b:(e(r, 1)) = 8: (e(e, Le(r, 1)). 
Hence we shall assume that m > 1. Then from 

5, (e(o, 1))e(r, 7) + e(e, t)b:(e(r, 7)) = 8:(e(e, t)e(r, 7)) 
we have, for i # j, 
(1.1.6) x(o, 1, j)o(t) — y(r, 4, j)r (4) = (oe + 7, 4,7) (o(4) — r(4)); 
(1.1.7) p> v(o, i, k)r(k) = y(e, i, f)o(j) — (5, ITH 

+ y(o + 17,3, j)r() — vlo + 1, 4, j)o(j). 

Setting o = 0 in (1.1.7) and using the fact that G is total, we have 


(1.1.8) 7(0,1,k) = 0 

for all 4 and k. Set r = — aa, in (1.1.6) and use (1.1.8). Then we have, for 
any ¢ and 1 # j, 

(1.1.9) ¥(¢, 1,7) + y(— ¢,1,7) = 0. 


Replace + in (1.1.6) by — 7, and use (1.1.9). Then we have 

x(o, 4, j)o(t) — y(7, 4, 7)r(t) = v(o — 7, 4,7) (o(a) + 7(2)). 
Combining this with (1.1.6) yields 
(1.1.10) ye — 1, i, j)(o(é) + r(i)) = y(o + 1, 3, j)(o(’) — r(i)). 


Since @ is an elementary p-group and p # 2,0 — rando + +r may be regarded 
as two arbitrary elements in @. Hence by (1.1.10) it follows that, for 1 # j, 


(1.1.11) x(o, i,j) = ayo(i), 


where a,, are in K and independent of c. Substituting this in (1.1.7) we 
obtain 


(1.1.12) (oe, i, i)r(i) + SS auo(i)r(k) 
kei 


= ¥(o + 1,7, j)r(t) — ¥(r, 79, J) 7 (4) — aisr(t)o(y), 


which shows that (y(o¢ + 1, 7,7) — y(r,Jj,7))r(4) is additive with respect to 
r. Hence 


(1.1.18) vlo+7,j,j) — ¥(r,5,97) = ve —7,9,97) —¥l— 153) 
for all o and r. Let o = + in the above and use (1.1.8). Then 
(1.1.14) ¥(2r, 7,3) — v7.5.9) = — ¥(— 1,5, j)- 

By (1.1.13) and (1.1.4) we have 











348 RIMHAK REE 


v(o + 17,9,9) = (oe — 17,9, 9) + v(2r, 7,9) 
which shows that y(c,j, 7) is additive with regard to ¢, since, as before, 
¢ +7 and o — r can be regarded as two arbitrary elements in @. Now from 
(1.1.12) we obtain 
y(c, 4, i)r(i) + >) aac(i)r(k) = y(o, 7, j)r(i) — argr(io(j) 


kF i 


for all o and r. Using the fact that G is total, we see from the above that 
ay = 0 for k ¥ i and that y(oe, 1,7) = y(c,j, 7) for any i and j. Set y(e, i, 4) 
= ¢(c). Then ¢ is additive, and we have (1.1.1) as desired. Thus Theorem 
1.1 is proved. 

When is the derivation 6 defined by 4(e(c, 1)) = o(c)e(c, i), where ¢ is an 
additive function on G, inner? Let 


5(e(o, 7)) = e(e, 7) oe a,, s€(T, 7) 
with a,,, € K. Then 
0 = e(0,i) = >> ax, r(i)e(r, j). 


Hence r(i) = 0, r = 0, whenever a,,, ~ 0. From this it follows that 6 is inner 
if, and only if, ¢(¢) = } y(j) with a, € K. Such additive functions ¢ form 
clearly an m-dimensional vector space over K. On the other hand, if © is an 
elementary group of order p", then all the additive functions on G with values 
in K form an n-dimensional vector space over K. Hence we have 


CorROLLARY 1.2. Let & be a generalized Witt aigebra with basis {e(¢,i)\o € G, 
i € I}, where © is an elementary p-group of order p", and I = {1,2,..., mb}. 
Let D and & be the derivation algebra and the algebra of inner derivations of &, 
respectively. Then D/& is an abelian algebra of dimension n — m, provided 
that the characteristic of K is greater than 2. 


From the above corollary it follows immediately that the number m is 
uniquely determined by &. This is, however, proved in (5, p. 546). Also, if 
m = n, then every derivation of & is inner. This is a result of Jacobson (3). 


2. Generalized orthogonal systems. Let W& be a finite-dimensional 
commutative associative algebra over the algebraically closed ground field 
K. We assume that & has a unity element. 

An ordered set (D,,..., Dn) of derivations of & will be called a generalized 
orthogonal (g.o.) system if the following conditions (2.1.1.)-(2.1.2) are satisfied: 


(2.1.1.) [D,, D,] = DD, — DD, = 0 for all i and j; 


(2.1.2) If f € Mand dy,...,Am € K are such that Dif = dif for all i, then 
f = 0 or f is a unit of U. 











Qe em 


's 


if 





GENERALIZED WITT ALGEBRAS 349 
A g.o. system (D;,...,D,) will be called an o. system if it satisfies the 
following condition: 
(2.1.3.) Yoai™fD, = 0, where f; € A, implies f,; = 0 for all i. 
LEMMA 2.1. The conditions (2.1.1.)—(2.1.2) imply the following: 
(2.1.4) Dg = 0 for alli =1,...,m implies f € K. 


Proof. The set % of all f € UM such that D,f = 0 for all i is clearly a sub- 
algebra of &, and, moreover, if 0 = f € B then by (2.1.2) f-' exists and 
belongs to %, since D,f-' = — f-*Dif = 0. Therefore, % is a finite extension 
field of K. Since K is algebraically closed, we have 8 = K. 


THEOREM 2.2. For any g.o. system (D,,...,Dm) there exists a non-void 
subset S = {1,...,%,} of indices 1,...,m such that (2.2.1)—(2.2.2), below, 
hold: 


(2.2.1) (Dy,...,D 
(2.2.2) There exists a; € K such that 


) is an o. system; 


ty 


D,= > a,D,, (¢ = 1...., 8). 
seS8 
Proof. Let S be a minimal subset of the indices 1,...,m with respect to 
the property: there exist a,, € W% such that 
(2.2.3) D,= > a,,D, fa m). 
seS8 
We may assume without loss of generality that S = {1,...,r}. Let V be the 
set of all r-tuples (f:,...,/,) of elements f, € UM such that >>.f,D, = 0. Define 
addition in V componentwise, scalar multiplication by a(/fi,..., f) = 
(af:,...,af,),a € K. Then V isa finite-dimensional vector space over K. We 


shall prove (2.1.3) for (D:,...,D,) by showing that V = 0. Suppose V # 0. 
Since >f,D,=0 implies >,(Dd,)D, = 0, the mapping (f:,...,f,)—- 
(Dfi,..., D¢,) is a linear transformation of V. Since D,(D,f) = D,(D,f) for 
all f € W, 4, and j, and since K is algebraically closed, there exists a non-zero 
(f:1,..-,f7) € V and Ay,...,Am € K such that 


(Difi,...,Difr) = Ac(fr,.-- fr) 


for i= 1,...,m. Then Dif, = Ad, for all i and s. Then from (2.1.2) it 
follows that f, is either 0 or a unit in 4. Since not all f, are zero, we may assume 
fi #0; f: is a unit. Then D; = —fi;"'f2D.2 —... —fr-'f,D,. Then every D, 
can be written as a linear combination of D2, ...,D, with coefficients in WY. 
This contradicts the minimality of S. Thus V = 0, and hence (2.1.3) is proved 
for (D,,..., Dy). 

Now, from (2.1.1) and (2.2.3) it follows that }>,(D,a,,)D, = 0 for all 
i,k =1,...,m. Therefore by (2.2.1), we have D,a,;, = 0 and hence, by 
Lemma 2.1, a;, = a,, € K for all ¢ and s. This proves (2.2.2). 











350 RIMHAK REE 


In order to show that (D,,...,D,) is an o. system, it remains to be shown 
that Df = X,f, \. € K, for s = 1,...,7 implies that f = 0 or f is a unit. 
This, however, follows easily from (2.2.2) and (2.1.2). Thus the proof of 
Theorem 2.2 is complete. 


COROLLARY 2.3. A g.o. system (D,,...,D,) is an o. system if, and only if, 
D,,..., Dm are linearly independent over K. 


COROLLARY 2.4. If there exists a g.o. system of derivations of UA, then & is 
isomor phic to the group algebra over K of an abelian p-group of type (p, p, .. . , p). 


Proof. By Theorem 2.2, there exists an o. system of derivations of %. Then 
Corollary 2.3 follows from Lemma 2.1 above and Theorem 6.10 of (5). 


COROLLARY 2.5. The conditions (2.1.1)—(2.1.2) imply the following: If 
Sees am € UM are such that D.f = af for all i, then f = 0 orf is a unit 


Proof. Corollary 2.5 follows immediately from Theorem 2.2 above, and 
Lemma 6.3 of (5). 


The following theorem, which also follows immediately from Theorem 2.2, 
above, and Theorem 6.10 of (5), is a partial generalization of Theorem 6.10 
of (5). 


THEOREM 2.6. If (Di,..., Dn) 1s @ g.o. system, then the subalgebra of the 
derivation algebra of U, consisting of all derivations of the form >-f.D,, where 
f;: © A, is isomorphic to a generalized Witt algebra. 


Now let (Do,..., Dn) be a set of derivations of Y, satisfying (2.1.1), and 
let do,...,@m € A be such that Da, = Dg, for all i and j. Then the set 
Y = L(Do,..., Dn; @o,..., Gm) of all derivations of the form >-f,D,, where 
f,€ AW satisfy > (Dg; — af) = 0, forms a subalgebra of the derivation 
algebra of &. A special case of such algebras was considered for the first time 
by Frank (2), and another by Albert and Frank (1). The general case where 
(Do, ..., Dm) is an 0. system was considered by Jennings and Ree (4). Here 
we consider the case where (Do,...,D,,) is an arbitrary g.o. system. 


THEOREM 2.7. If (Do, ... , Dm) 1s @ g.0. system, then the algebra L(Do,... , Dn; 
Go,...,@m) ts tsomorphic either to a generalized Witt algebra or to an algebra 
of the form L(Dyo’,..., D,'; ao’, ...,@,'), where (Do’,..., D,’) is an o. system. 


Proof. If m = 0, then (Do,...,D,) is an o. system, and so our theorem 
is clear. We shall proceed by induction on m. Assume that Theorem 2.7 is 
true for m — 1. If (Do,...,D,) is an o. system then our theorem is clear. If 
(Do,..., Dm) is not an o. system, then, by Theorem 2.2, we may assume 
without loss of generality that D, = aoDo + ... + a@m—iDm_1 with a, € K. 
We have 








for 

















GENERALIZED WITT ALGEBRAS 


m—1 m—1 
D,( on _— a aa.) = Dude - , > aD a, = () 


t=O 


for k = 0,1,...,m. Hence 
m—1 


Qn — >, ag, =a 


t=O 


belongs to K by Lemma 2.1. 

If a = 0 then 2 = 2(Do,..., Daj ao,...,@n) and 2; = 2(Do,..., D.-1; 
@o,...,@m-—1) coincide. This is seen as follows: Let }o"f/,D, € L. Then by 
definition, S-o"(Dif; — af) = 0, and hence 


> (Dif; + afm) = a.(f; + adm)) = 0. 


t=O 


On the other hand, 


m 


dL {Di -” ) > fi + aifm)D;. 


i=0 


Therefore, S-o"f.D, € %: and hence & < &; is proved. Since %, < & is clear, 
we have 2 = &. 

If a #0 then 2 = Q(Do,..., Dnjao,...,@m) coincides with the set 2 
of all derivations of the form }°o"~'g,D,, where g, runs over %. This is seen 
as follows: Clearly we have L < L». Now, for an arbitrary element }-o"~'g,D, 
in %, define fo, fi,...,/m by the formulae: 


fa za > (Digi — agi); 


i=0 


fe = 81 — afm, (0<1< my). 
Then it is easily seen that }°9""'g.D, = }oo"f,D,, and that 


> (Ddi-— af, =0. 


t=—0 


Therefore }00"-'g,D, € &%, and hence 2; < & is proved. Thus we have & = &». 
Since 2, is a generalized Witt algebra, this completes the proof of Theorem 2.7. 

Consider now a set of derivations (D,,...,D,) of & satisfying only the 
condition (2.1.1) and denote by % the subalgebra of the derivation algebra 
of & consisting of all derivations of the form f,D,, where f, € &. Let N be the 
radical of U, and let O be the set of all f € MN such that D,(D,(... (Dif)... )) 
€ N for any i, j,...,& (the number of indices i, j,..., & is arbitrary). It is 
easily seen that © is an ideal of & and that f € O implies Dif € © for all i. 
Therefore every D, induces a derivation D, of the algebra Y¥ = A/D. Since 
[D,, D,] = 0 follows from [D,, D,] = 0, we can consider the subalgebra & of 
the derivation algebra of % consisting of all derivations of the form ¥/,D,, 
where f, € %. Denote by f the image of f € % under the natural homomor- 
phism: % — Y. Since f,D, = 0 implies f,D, = 0, a mapping ¢ is uniquely 











352 RIMHAK REE 


defined by ¢(Xf.D,) = Xf,Di. It is easily seen that ¢ is a homomorphism 
of 2 onto &. The kernel $ of ¢ consists of elements }-f,D, such that }f,D, = 0. 
Note that }f,D, = 0 if and only if |f,(Dg) € © for all g € &. From this 
it follows immediately that the ideal [$, %] of 2 is contained in the algebra &; 
consisting of all derivations of the form >> f,D,, where f; € ©. For a positive 
integer k, denote by &, the algebra of all derivations of the form >-f,D;, where 
f, € *. It is easily seen that [%,, 21] < &4: for any k. Since D < MR, it follows 
that © is nilpotent, say, O' = 0. Then &, = 0, and hence &, is nilpotent, and 
¥ is solvable. 

Consider now the algebra 2, assuming that every non-unit element in % is 
contained in the radical %. We shall prove that (D,,..., D,) is a g.o. system 
of %. Suppose that D,f = A,f for all i, and that f is a non-unit in Y. Then 
Do = dof + gs, where g, € ©. Since f is not a unit f is also not a unit, and 
hence by our assumption f € 9. Then from Dif — A,f € © it follows easily 
that f € D. Therefore f = 0, and hence (D,,..., D,) is proved to be a g.o. 
system. Then, by Theorem 1.6, & is isomorphic to a generalized Witt algebra. 

An associative algebra & is called completely primary if the set of non-unit 
elements coincide with the radical of &. Summarizing the above, we have 


THEOREM 2.8. Suppose that the commutative associative algebra U is completely 
primary. Then for any set of derivatives (D,,..., Dm) of UX, which satisfies the 
condition (2.1.1.), the algebra 2% consisting of all derivations of the form >-f D,, 
where f, © U, has a solvable ideal $ such that 2/X is isomorphic to a generalized 
Witt algebra. 


Similarly we may obtain the following 


THEOREM 2.9. Suppose that the commutative associative algebra U is completely 
primary. Then for any set of derivations (D,,..., Dm) of U, which satisfies the 
condition (2.1.1), an algebra 2 of the form L(Do,..., Dm; do,...,@m) has a 
solvable ideal 3 such that 2/& is isomorphic either to a generalized Witt algebra 
or to an algebra of the form 2(Eo,..., E,; bo, ...,6,), where (Eo,..., E,) isan 
o. system of derivations of the group algebra over K of an abelian group of type 


Veer 


REFERENCES 


1. A. A. Albert and M. S. Frank, Simple Lie algebras of characteristic p, Rendiconti del Sem. 
Mat., Univ. e Politech di Torino, 14 (1955), 117-39. 

. M.S. Frank, A new class of simple Lie algebras, Proc. Nat. Acad. Sci., 40 (1954), 713-18. 

. N. Jacobson, Abstract derivation and Lie algebras, Trans. Amer. Math. Soc., 42 (1937), 
206-24. 

4. S. A. Jennings and Rimhak Ree, On a class of Lie algebras of characteristic p, Trans. Amer. 

Math. Soc., 84 (1957), 192-207. 
5. Rimhak Ree, On generalized Witt algebras, Trans. Amer. Math. Soc., 83 (1956), 510-46. 


ow 


The Unwersity of British Columbia 


EEE OT eee 











SUPERSOLUBLE IMMERSION 
REINHOLD BAER 


Supersoluble immersion of a normal subgroup K of a finite group G shall 
be defined by the following property: 

If ¢ is a homomorphism of G, and if the minimal normal subgroup J of 
G* is part of K*, then J is cyclic (of order a prime). 

Our principal aim in the present investigation is the proof of the equiva- 
lence of the following three properties of the normal subgroup K of the finite 
group G: 

(i) K is supersolubly immersed in G. 

(ii) K/K is supersolubly immersed in G/¢K. 

(iii) If @ is the group of automorphisms induced in the p-subgroup U of 
K by elements in the normalizer of U in G, then & @~' is a p-subgroup of @. 

Though most of our discussion is concerned with the proof of this theorem, 
some of our concepts and results are of independent interest. In § 1 we investi- 
gate groups G such that G’ G’—' is a p-group. In §2 some new and useful 
characterizations of supersoluble groups are obtained. In §3 we substitute 
for supersoluble immersion the concept of a supersoluble pair which consists 
of a group G and a group @ of automorphisms of G meeting the following 
requirement: 

If L is a 6-admissible normal subgroup of G,then every minimal @-admissible 
normal subgroup of G/L is cyclic (of order a prime). 

These supersoluble pairs are somewhat easier to handle than supersoluble 
immersion, though their investigation is, for all practical purposes, equivalent 
to that of supersoluble immersion. 


Notations 
G’ = commutator subgroup of G. 


ZG = centre of G. 

¢G = Frattini subgroup of G = intersection of all the maximal subgroups 
of G. 

p-elements and p-groups are elements and groups of order a power of the 
prime p. 


G* = subgroup of G, generated by all the kth powers of elements in G. 

G is a group of exponent e, if G* = 1. 

G is p-closed, if products of p-elements are p-elements. 

If U is a subgroup of G, then NU is the normalizer and CU the centralizer 
of U in G. 


Received October 14, 1958. 








354 REINHOLD BAER 


6 is an irreducible group of automorphisms of the group G, if 1 and G are 
the only 6-admissible subgroups of G. 
All groups considered are finite. 


0. We begin with a survey of the salient facts of the theory of finite super- 
soluble groups; for details cf. (1; 2, § 11) and (5). A group G is termed super- 
soluble, if every epimorphic image, not 1, of G possesses a cyclic normal 
subgroup different from 1. This implies the apparently stronger fact that 
the minimal normal subgroups of the epimorphic images of supersoluble 
groups are cyclic of order a prime. Subgroups, epimorphic images, and direct 
products of supersoluble groups are likewise supersoluble. Extensions of 
supersoluble groups by supersoluble groups are, in general, not supersoluble; 
but extensions of cyclic groups by supersoluble groups and central extensions 
of supersoluble groups by supersoluble groups are supersoluble. 


HupPert’s THEOREM: The following three properties of G are equivalent: G 
ts supersoluble; G/oG is supersoluble; every maximal subgroup of G has index 
a prime. 


If G is supersoluble, then its commutator subgroup G’ is nilpotent; and G 
has the 

Sylow Tower Property of supersoluble groups: If H is an epimorphic image 
of G and ?f is a maximal prime divisor of the order of H, then the totality P 
of p-elements in H is a characteristic p-subgroup of H; in other words: H is 
p-closed. 


1. In this section we are going to discuss a very special class of supersoluble 
groups which, however, will prove important in the sequel. 

We recali that it is customary to term exponent of a group G the l.c.m. of 
the orders of the elements in G. For our purpose it will be more convenient 
to say that G is a group of exponent e whenever G* = 1, that is, whenever e is 
some common multiple of the orders of the elements in G. If is a prime, 
then the group G is termed p-closed, whenever products of elements of order 
a power of are again elements of order a power of p. This is equivalent 
to requiring the existence of one and only one p-Sylow subgroup which is 
then a characteristic p-subgroup of index prime to p; and this characteristic 
p-subgroup of G shall be termed the p-component of G. 


Definition. If the group G is p-closed, and if G/P is abelian of exponent 
pb — 1, where P is the p-component of G, then G is strictly p-closed. 

If G is strictly p-closed, then its commutator subgroup G’ and the sub- 
group G?—! generated by the (p — 1)th powers of elements in G are charac- 
teristic p-subgroups. If conversely G’ and G?-' are p-subgroups, then G’ G?"! 
is a characteristic p-subgroup of G such that G/G’G?~' is abelian of exponent 
pb — 1. Hence G is strictly p-closed if, and only if, G’ and G?— are p-subgroups 
of G. 











ire 








SUPERSOLUBLE IMMERSION 355 


It is easy to verify that subgroups, epimorphic images, and direct products 
of strictly p-closed groups are again strictly p-closed. Likewise, extensions 
of p-groups by strictly p-closed groups are strictly p-closed. 

Consider a strictly p-closed group G. Suppose that M is a minimal normal 
subgroup of an epimorphic image H of G. Then H is likewise strictly p-closed. 
Hence H’H?~' is a characteristic p-subgroup of H and at the same time the 
p-Sylow subgroup of H. If the order of M is prime to p, then 


(M, WH’) < MOH’ =1; 


and if the order of M is divisible by p, then M is part of the p-component 
H'H?- of H. Application of a well-known property of p-groups shows that 
in this case M (\ Z(H’H?-') # 1; and this implies M < Z(H'H*®-"') because 
of the minimality of M. Thus we have shown again that [M, H’H?-'] = 1. 
Hence condition (viii) of (2, p. 184, Theorem 1) is satisfied by G; and this 
implies that strictly p-closed groups are supersoluble. This important fact has 
various consequences. 


THEOREM 1.1. G is a cyclic group of order p if, and only if, G is a p-group, 
not 1, possessing an irreducible and strictly p-closed group of automorphisms. 


Proof. lf G is cyclic of order p, then its group of automorphisms is cyclic 
of order p — 1, proving the necessity of our condition. If conversely, G # 1 
is a p-group and @ is an irreducible and strictly p-closed group of automor- 
phisms of G, then we recall that G is a normal subgroup of its own holomorph 
and that we may form consequently the subgroup G@ of the holomorph of 
G. Since @ is irreducible, G is a minimal normal subgroup of Gé@. Since G is a 
p-group and @ is strictly p-closed, G@ is likewise strictly p-closed. Hence Gé@ 
is in particular supersoluble; and this implies that its minimal normal sub- 
group G is cyclic. Thus G is of order p. 

For the convenience of the reader we insert here some well-known facts 
concerning automorphisms of p-groups. 


LEMMA. The automorphism oa of the p-group G is of order a power of p, if it 
satisfies one of the following conditions: 

(a) o induces the identity automorphism in G/$G or 

(b) there exists a c-admissible normal chain of G in whose factors the identity 
automorphism is induced by co. 


Proof. The sufficiency of condition (a) is contained in a result due to P. 
Hall (4, p. 38). Assume next the existence of c-admissible subgroups U(#) 
of G such that 


1 = U(O), U(i) is a normal subgroup of U(i+1), U(k) = G, 
o induces the identity automorphism in every U(i + 1)/U(#). 


Then o certainly induces a p-automorphism in U(0). We may therefore make 
the inductive hypothesis that o induces a p-automorphism in U(i) for some 











356 REINHOLD BAER 


i < k. There exists consequently a positive integer m such that o”" induces 
the identity automorphism in U(i). Hence o”" induces the identity automor- 
phism both in U(i) and in U(i + 1)/U(i). It is well known (and may be 
verified by a simple computation) that o”" induces a p-automorphism in U(i+1). 
Thus we have shown that ¢ induces a p-automorphism in U(i + 1), completing 
our inductive argument. Hence ¢ is a p-automorphism of U(k) = G. 


THEOREM 1.2. A group G with ZG = 1 is strictly p-closed if, and only if, 
maximal subgroups of G are either normal or else have index p in G. 


Proof. lf G is strictly p-closed, then G’G?-' is a p-subgroup of G. If the 
maximal subgroup S of G is not normal, then in particular G’ is not part of 
S. Hence G = SG’. Since G’ is a p-subgroup, this implies that [G:S] is a power 
of p. But strictly p-closed groups are supersoluble; and the maximal sub- 
groups of supersoluble groups have index a prime. Hence [G:S] = p, proving 
the necessity of our condition. 

Assume conversely the validity of our condition. Denote by P some p-Sylow 
subgroup of G and by NP its normalizer in G. If P were not normal, then 
NP # G and there would exist a maximal subgroup S of G containing NP. 
From P < NP < S we conclude that [G:S] is prime to p; and this implies 
by hypothesis that S is a normal subgroup of G. Consequentiy S contains 
every p-Sylow subgroup of G as its own p-Sylow subgroup so that p-Sylow 
subgroups of G are conjugate in S. Application of the Frattini argument 
shows that G = S-NP = S <G, a contradiction proving the normality of 
P and the p-closure of G. Since every maximal subgroup of G which contains 
P has index prime to p, these maximal subgroups are, by hypothesis, normal. 
Consequently every maximal subgroup of G/P is normal; and this implies 
by Wielandt’s Theorem the nilpotency of G/P; see (7, p. 108, Satz 13). Appli- 
cation of Schur’s Theorem shows the existence of a complement D to P in 
G, since [G:P] is prime to p (7, p.125, Satz 25). Since G/P ~ D, this sub- 
group D of G is nilpotent too. Every maximal subgroup of G has, by hypothesis, 
a prime index. Application of Huppert’s Theorem shows the supersolubility 
of G; see (5, p. 416, Satz 9) or (2, p. 184, Theorem 1). Consider next normal 
subgroups A and B of G satisfying A < B < Pand [B:A] = p. Since G induces 
in the cyclic group B/A of order p a cyclic group of automorphisms whose 
order is a divisor of p — 1, it follows in particular that [B, D’D?-'] < A. Since 
G has been shown to be supersoluble, there exist normal subgroups A (i) of G 
such that 


1 = A(0), A(t) < A(Zi +1), A(R) = P, [A(¥i+ 1):A(d] = . 


From what we have shown just now it follows that [A (i + 1), D’D?-"] < A(i) 
for every 1. 

In other words: every element in D’D?—' induces an automorphism in P 
which in turn induces the identity automorphism in every A (i+ 1)/A (ji). 
By Lemma (b) such an automorphism has order a power of p. Consequently 





—_ Oe Seer 





or 


b 
g 
t 
( 
, 
i! 
( 
¢ 
t 








a ee LE a 


SESE NTS 





SUPERSOLUBLE IMMERSION 357 


every element in D’D?—' induces the identity automorphism in P. If D’D?-' 
were not 1, then we would deduce D’D’-' (\ ZD # 1 from the nilpotency of 
D. Elements in D’D?-' (\ ZD commute with every element in P and every 
element in D; and they belong consequently to the centre of PD = G. But 
ZG = 1 by hypothesis and hence D’D?-' (\ ZD = 1, a contradiction which 
proves that D’D?—' = 1. It follows from G/P ~ D that G’G?"' < P; and this 
completes the proof of the strict p-closure of G. 


Remark. Note that ZG = 1 was not needed for the proof of the necessity 
of our condition. The example of suitably selected nilpotent groups shows 
that ZG = 1 is indispensable for the proof of the sufficiency of our condition. 


THEOREM 1.3. A group G is strictly p-closed if, and only if, 

(a) elements in G do not induce automorphisms of order p in subgroups of 
order prime to p and 

(b) subgroups of order prime to p are abelian of exponent p — 1. 


Proof. Assume first the existence of a normal p-subgroup P of G such 
that G/P is abelian of exponent p — 1. Consider a subgroup U of order prime 
to p. Then P(\ U = 1 so that U is isomorphic to the subgroup PU/P of 
G/P. Since the latter gruop is abelian of exponent p — 1, so is U. If further- 
more the element g in G induces in U an automorphism of order a power of 
p, then we may assume without loss of generality that g is a p-element. As 
such g belongs to P and the commutators [g, u] for vin U belong to P (\ U = 1. 
Hence g commutes with every element in U and so induces the identity 
automorphism in U. This proves the necessity of (a) and (b). 

If strict p-closure were not a consequence of (a) and (b), then there would 
exist a group G of minimal order satisfying (a), (b), without being strictly 
p-closed. Every subgroup of G meets requirements (a) and (b). Because of 
the minimality of G it follows that 

(1) every proper subgroup of G is strictly p-closed. 

Since G is not strictly p-closed, it is certainly not a p-group. Consequently 
there exists a prime gq # p dividing the order of G. Denote by Q a g-Sylow 
subgroup of G. By (b), Q ¥ 1 is abelian of exponent p — 1. If g is an element 
in the normalizer NQ of Q, then g is the product g = g’g” of an element g’ 
of order prime to p and an element g” of order a power of » both of which 
belong to NQ. Since Q{g’} is of order prime to ?, it is by (b) abelian so that 
g’ belongs to the centralizer of Q. It is a consequence of (a) that g’” belongs 
to the centralizer of Q. Thus NQ is the centralizer of the g-Sylow subgroup 
Q. Hence we may apply Burnside’s Theorem asserting the existence of a 
normal subgroup T of G complementary to Q, (7, p. 133, Satz 4). T isa 
proper subgroup of G, since Q # 1. Hence T is, by (1), strictly p-closed. 
Consequently the totality P of p-elements in T is a characteristic p-subgroup 
of T whose index [7:P] is prime to p. Since P is a characteristic subgroup of 
the normal subgroup 7, P is a normal subgroup of G. Since [G:T] is a power 














358 REINHOLD BAER 


of g, namely the order of Q, the index [G:P] is prime to p. Application of 
Schur’s Theorem shows the existence of a complement C of P in G. Since 
C ~ G/P is of order prime to ?, it is by (b) abelian of exponent p — 1. Hence 
G is strictly p-closed, a contradiction proving our theorem. 


2. In this section we derive a number of properties of supersoluble groups. 
Some of them are of independent interest and all of them will be needed in 
the sequel. 


THEOREM 2.1. The following properties of the group G are equivalent. 
(i) G is supersoluble. 
(ii) NU/CU is, for every p-subgroup U of G, strictly p-closed. 
(iii) The Sylow Tower Property of supersoluble groups is satisfied by G; and 
NP/CP is, for every p-Sylow subgroup P of G, strictly p-closed. 


Proof. Assume first the supersolubility of G; and consider a p-subgroup U 
of G. The group @ of automorphisms, induced in U by elements in NU, is 
essentially the same as NU/CU. Since G is supersoluble, so is its subgroup 
NU. Since U is a normal p-subgroup of the supersoluble group NU, there 
exist normal subgroups U(i) of NU such that 

1 = U(O), U(t) < Uli + 1), (UG + 1):U(d)] = p, U(R) = U. 
Normal subgroups of NU which are part of U are @-admissible. Thus every 
U(i) is 6-admissible. Denote by 6* the totality of those automorphisms in 0 
which induce the identity automorphism in every U(i + 1)/U(«). Clearly 
6* is a normal subgroup of @. An immediate application of § 1, Lemma (b) 
shows that every automorphism in @* is a p-automorphism. Thus @* is a 
normal p-subgroup of @. Since every U(i + 1)/U(1) is cyclic of order p, its 
group of automorphisms is cyclic of order p — 1. The automorphisms in 
6-9’ induce consequently the identity automorphism in every U(i + 1)/U(i). 
Hence @?—'#’ < @*. The isomorphic groups @ and NU/CU are therefore strictly 
p-closed, proving that (ii) is a consequence of (i). 

Assume next the validity of (ii). Consider a subgroup S of G and a minimal 
prime divisor p of the order of S. If U is a p-subgroup of S, then VU/CU is, 
by (ii), strictly p-closed. It follows that [VU\ S]/[CUC\ S] is likewise 
strictly p-closed. But p is a minimal prime divisor of the order of S. Hence 
[NU C\ S|/[CUC\ S] is a p-group. Thus p-automorphisms only are induced 
in U by elements in S. Consequently we may apply a result that we derived 
elsewhere assuring the validity of the Sylow Tower Property of supersoluble 
groups in G (3, Theorem 6.2). It follows that (iii) is a consequence of (ii). 

Assume finally the validity of (iii). If K is a normal subgroup of G and p 
a maximal prime divisor of [G:K], then the totality P* of p-elements in G* = G/K 
is a characteristic p-subgroup of index prime to p. If P is a p-Sylow subgroup 
of G, then P* = KP/K; and we note that KP is a normal subgroup of G, 
since P* is a characteristic subgroup of G*. Application of the Frattini 
argument shows thereforeG = (KP)NP = K-NP; and now one sees without 





——_e 




















SUPERSOLUBLE IMMERSION 359 


difficulty that the group of automorphisms induced in P* by elements in G* 
is an epimorphic image of the group of automorphisms induced in P by 
elements in NP. The latter group is essentially the same as NP/CP. Since P 
is a p-Sylow subgroup of G, we deduce strict p-closure of NP/CP from (iii). 
Consequently a strictly p-closed group @ of automorphisms is induced in P* by 
elements in G*. There exists a minimal normal subgroup M* of G* which is 
part of P*. The group @* of automorphisms which are induced in M* by 
elements in G* is an epimorphic image of 6. Hence @* is strictly p-closed and, 
because of the minimality of M*, irreducible. Application of Theorem 1.1 
shows that M* is cyclic of order p. Hence G is supersoluble so that (i) is a 
consequence of (iii), q.e.d. 


Remark 2.1. It is impossible to omit the first half of condition (iii) as may 
be seen from the following example. Assume that » and g are primes and 
that g is a divisor of p — 1. Then there exists a group A of order pg which 
possesses a normal subgroup B of order p such that the elements in A induce 
in B a group of automorphisms or order g. Clearly A is supersoluble, but 
not cyclic. Next denote by K an elementary abelian g-group of order q’* and 
let G be an extension of K by A such that A acts as a regular permutation 
group on a basis of K (we may choose G as a splitting extension of K by A). 
The group of automorphisms induced in K by elements in G is isomorphic 
to A and hence not strictly g-closed. This implies in particular that G, though 
soluble, is not supersoluble (Theorem 2.1). A g-Sylow subgroup of G is an 
extension of K by acyclic group of order q. Since A is an extension of a p-group 
by a q-group and not cyclic, one sees that g-Sylow subgroups of G are their 
own normalizers. Hence NQ/CQ = Q/ZQ is, for every g-Sylow subgroup Q 
of G, a qg-group. The p-Sylow subgroups of G are cyclic of order p. Their 
normalizers may contain elements in K; but these would belong to their 
centralizers. It follows that NP/CP is cyclic of order gq for every p-Sylow 
subgroup P of G; and such a group is strictly p-closed, since g is a divisor 
of p — 1. Thus G is soluble, but not supersoluble; and G satisfies these cond 
half of condition (iii), but not the Sylow Tower Property of supersoluble 
groups. 


Remark 2.2. Using results derived by us elsewhere (3, Theorem 6.2) one 
shows the equivalence of the three conditions of Theorem 2.1 with the follow- 
ing property: 

If p is a minimal prime divisor of the order of the subgroup S of G, then S is 
completely p-normal; and NP/CP is, for every p-Sylow subgroup P of G, strictly 
p-closed. 


THEOREM 2.2. Assume that P is a p-Sylow subgroup of a supersoluble group G. 
(a) (NP)'(NP)?-' is the direct product of a p-group and a group of order 
prime to p; and G = NP in case p is the maximal prime divisor of the order 
of G. 











360 REINHOLD BAER 


(b) PIV OG < oP; and P (\ 6G = oP in case p is the maximal prime divisor 
of the order of G. 


Proof. We note first that NP/CP is essentially the same as the group of 
automorphisms induced in P by elements in NP. It is a consequence of 
Theorem 2.1 that this group of automorphisms is strictly p-closed. Conse- 
quently (CP)(NP)'(NP)?-'/CP is a p-group. Since the p-Sylow subgroup P 
of G is a normal subgroup of NP, this implies 


(NP)' (NP)? < P-CP. 


Since P is also a normal subgroup of P-CP whose index is prime to #, it 
follows that P (\ CP = ZP is a normal p-subgroup of CP whose index in 
CP is prime to p. By Schur’s Theorem there exists a complement Q of P (\ CP 
in CP; see, for instance (7, p. 125, Satz 25). Hence 


P-CP = P(PC\ CP\O = PO. 


Since Q is part of the centralizer of P, P-CP is the direct product of P and 
Q. Since the elements in Q are of order prime to #, it follows that P-CP is 
the direct product of a p-group and a group of order prime to p. But this 
property is subgroup inherited. Hence (NP)’(NP)?"' is the direct product 
of a p-group and a group of order prime to ». That G = NP in case p is the 
maximal prime divisor of the order of G, is a consequence of Theorem 2.1 (the 
Sylow Tower Property of supersoluble groups). 

Denote by A the set of all those elements in G whose orders are divisible 
by primes greater than p only. Because of the Sylow Tower Property of 
supersoluble groups A is a characteristic subgroup of G whose order is divisible 
by primes greater than p only. The product AP is likewise a characteristic 
subgroup of G. It consists of just those elements in G whose orders are not 
divisible by primes smaller than ». By Schur’s Theorem or by P. Hall’s 
characteristic property for soluble groups there exists a complement B of A 
in G, since 0(A) and [G:A] are relatively prime. Since B is isomorphic to 
G/A, the complement B contains a p-Sylow subgroup of G; and since any 
two p-Sylow subgroups of G are conjugate in G, we may assume without 
loss in generality that P < B. Since AP is a characteristic subgroup of G, 
P = AP (\B is a normal subgroup of B. The characteristic subgroup ¢P 
of P is consequently a normal subgroup of B. Let B* = B/¢P and P* = P/¢P. 
Then P* is an elementary abelian p-group, the p-Sylow subgroup of B* and 
characteristic in B*. Since B* is supersoluble, the elements in B* induce in 
P* a strictly p-closed group @ of automorphisms. This group @ is essentially 
the same as B*/CP*. Since P* is abelian, P* < CP* so that [B*:CP*] is 
prime to p. Hence @ is a strictly p-closed group of order prime to p; in other 
words @ is abelian of exponent p — 1. Since the group @ of order prime to p 
acts on the elementary abelian p-group P*, it iscompletely reducible (Maschke’s 
Theorem) (6, p. 81, Theorem 46). This signifies that every 6-admissible 


— ero 


— 


—_— 











of 
of 


in 





ng Ee ae 





—— ee 











SUPERSOLUBLE IMMERSION 361 


subgroup of P* possesses in P* a 6-admissible complement. Because of the 
supersolubility of B* minimal normal subgroups of B* which are contained 
in P* have order p. It follows that P* is the direct product of 6-admissible 
cyclic groups of order p. This in turn implies that the intersection of all 
maximal 6-admissible subgroups of P* is equal to 1. Consider now some 
maximal @-admissible subgroup M of P*. Then M has index p in P* and is a 
normal subgroup of B*. There exists, by Schur’s Theorem, a complement D 
of P* in B*. It is clear that MD is a maximal subgroup of B*. It follows 
now that 


P* (\ @B* = 1. 


Every maximal subgroup of B* has the form S/¢@P with S a maximal sub- 
group of B. If J is the intersection of all these maximal subgroups S, then 
we deduce P(\J = $P from P* (\ ¢B* = 1. If S is a maximal subgroup 
of B, then AS is a maximal subgroup of G, since B is a complement of the 
characteristic subgroup A of G. From P(\J = ¢P we deduce now that 
P(\ 0G < oP. 

Suppose finally in particular that p is the maximal prime divisor of the 
order of G. Then P is a characteristic subgroup of G (Sylow Tower Property 
of supersoluble groups). Consequently #P is a characteristic subgroup of G 
too. We recall that maximal subgroups of supersoluble groups have index 
a prime. If the maximal subgroup S of G does not contain P, then conse- 
quently [G:S] = p. From G = PS we deduce now that [P:P (\ S] = p, since 
P is a characteristic subgroup of G. It follows that ¢P < P(\S < S. Thus 
we have shown ¢P < #G. Hence 


P(VNO@G © oP < PING, 


proving ¢P = P/\@G in case p is the maximal prime divisor of the order 
of G. 


Remark. Consider primes p, g such that g’ is a divisor of p — 1. Then there 
exists an extension G of a cyclic group P of order p by a cyclic group of 
order g* such that the elements in G induce in P a cyclic group of automor- 
phisms whose order is g*. Every g-Sylow subgroup of G is cyclic of order q’ 
and a maximal subgroup of G; and there exist two g-Sylow subgroups of G 
with intersection 1. Hence ¢G = 1. If Q is a g-Sylow subgroup of G, then 
¢Q is cyclic of order g. Hence 


QM¢G=1< 40 
showing the impossibility of improving (b). 
3. We are now ready to turn to the study of supersoluble immersion. 


Definition. A normal subgroup K of G is supersolubly immersed in G if to 
every homomorphism ¢ of G with K* # 1 there exists a cyclic normal subgroup 
A #1 of G* such that A < K’. 











362 REINHOLD BAER 


This implies the apparently stronger property mentioned in the intro- 
duction: If K is a supersolubly immersed normal subgroup of G, if ¢ isa 
homomorphism of G, and if a minimal normal subgroup M of G* is part of 
K*, then M is cyclic of order a prime. 

We note that S is a supersoluble subgroup of G if S contains the super- 
solubly immersed normal subgroup K of G, and if S/K is supersoluble. This 
important property has two interesting consequences: 

(a) The product of all supersolubly immersed normal subgroups of G is a 
supersolubly immersed characteristic subgroup of G. 

(b) Every maximal supersoluble subgroup of G contains every supersolubly 
immersed normal subgroup of G. 

Examples show, however, that the intersection of all maximal supersoluble 
subgroups of G may actually be greater than the product of all supersolubly 
immersed normal subgroups of G. 

Much use will be made of the following theorem: If the normal subgroup 
K_ of G is supersolubly immersed in G, then the elements in G induce in K a 
supersoluble group of automorphisms (5, p. 420, Satz 12). 

This leads us to the following companion concept. 


Definition. A group G and a group @ of automorphisms form a supersoluble 
pair if the minimal 6-admissible normal subgroups of G/T, for T a 6-admissible 
normal subgroup of G, are cyclic (of order a prime). 


If, for instance, T is a supersolubly immersed normal subgroup of the 
group G and [I is the group of automorphisms, induced in T by elements 
in G, then 7, T is clearly a supersoluble pair. In a way the converse is true 
too. For consider a supersoluble pair G,#. Then let H be the holomorph 
of G. This contains G as a normal subgroup and it also contains 6. Their 
product G@ is an extension of G by @ which realizes in G in an obvious way the 
automorphism group @. We shall refer to this splitting extension of G by 6 
as to the product of the group G and its group 9 of automorphisms. Since G, @ 
is a supersoluble pair, it is quite obvious that G is supersolubly immersed in 
G@. It follows in particular that the group I of automorphisms induced in G 
by G@ is supersoluble. Since @ < IT, we have shown the supersolubility of @. 
Incidentally we have shown that G, I is a supersoluble pair where the group 
r of automorphisms of G is the compositum of @ and the group of inner 
automorphisms of G. 

We may summarize the principal results of the preceding discussion as 
follows: 

The following properties of the normal subgroup T of the group G are equiva- 
lent. 

(i) T is supersolubly immersed in G. 
(ii) A supersoluble pair is formed by T and the group T of automorphisms 
induced in T by G. 
(iii) TT is supersoluble. 














oe 


NSO ne SOS AV SB eS OO DM 


a 














SUPERSOLUBLE IMMERSION 363 


We mention finally the following easily verified inheritance properties: If 
G, @ is a supersoluble pair, and if U is a 6-admissible subgroup of G, then (if 
we denote the group of automorphisms induced in U by @ likewise by 6) U, 6 
is a supersoluble pair. If J is a 6-admissible normal subgroup of G, then (if 
we denote the group of automorphisms induced in G/J by @ likewise by @) 
G/J, @ is a supersoluble pair. If, furthermore, @ is a group of automorphisms 
of the group G, if L is a cyclic @-admissible normal subgroup, and if G/L, @ 
is a supersoluble pair, then G, @ is a supersoluble pair. 

If X is a subgroup of G, it will be convenient to denote by @y the totality 
(subgroup) of X-preserving automorphisms in the group @. 


THEOREM 3.1. Jf a group 0 of automorphisms of the group G contains all the 
inner automorphisms of G, then the following properties of the pair G,@ are 
equivalent: 

(i) G, 04s a supersoluble pair. 

(ii) If U is a p-subgroup of G, then Oy induces a strictly p-closed group of 
automorphisms in U. 

(iii) G has the Sylow Tower Property of supersoluble groups; and if P is a 
p-Sylow subgroup of G, then 0p induces a strictly p-closed group of auto- 
morphisms in P. 

(iv) 6 has the Sylow Tower Property of supersoluble groups; and if = is a 
subgroup of 0 and P a y-admissible p-Sylow subgroup of G, then maximal 
y-admissible subgroups of P have index p in P. 


Proof. lf G, @ is a supersoluble pair, then their product Gé@ is a supersoluble 
group. If U is a p-subgroup of G, then the normalizer of U in G@ induces in U 
a strictly p-closed group IT of automorphisms (Theorem 2.1). Since 6@y is by 
definition part of the normalizer of U in G@, it follows that @y induces in U 
a subgroup of I. Since the latter is strictly p-closed, so is its subgroup induced 
by @y. Hence (ii) is a consequence of (i). 

If (ii) is satisfied by the pair G, @ then the second part of (iii) is, as a special 
case of (ii), likewise satisfied. If U is a p-subgroup of G, then the group T of 
automorphisms of U which are induced in U by elements in the normalizer 
NU of U in G is a subgroup of the group of automorphisms induced in U by 
automorphisms in 6y. The latter group of automorphisms of U is strictly 
p-closed by (ii). Hence T is strictly p-closed too. Thus the condition (ii) of 
Theorem 2.1 is satisfied by G, proving the supersolubility of G. Thus we 
have shown that (iii) is a consequence of (ii). 

Assume next that (iii) is satisfied by the pair G, @. If K is a 9-admissible 
normal subgroup of G, then @ induces in G/K a group = of automorphisms. 
Denote by S/K a p-Sylow subgroup of G/K. Then the group of S/K-preserving 
automorphisms in = may be denoted by Zs, since it is induced by the auto- 
morphisms in @s. Denote by P a p-Sylow subgroup of S. Since S/K is a 
p-Sylow subgroup of G/K, we have S = KP and P isa p-Sylow subgroup of 
G. Application of (iii) shows that 6p induces in P a strictly p-closed group 











364 REINIIOLD BAER 


of automorphisms. If ¢ is an automorphism in @s5, then KP = S = St = KP*, 
since K is @-admissible. Hence P’ is a p-Sylow subgroup of KP. Consequently 
there exist elements a and 6 in K and P respectively such that P”7 = P™* = P*, 
Since @ contains every inner automorphism, the automorphism ca™! belongs 
to 6p. Since a belongs to K, it induces the identity automorphism in G/K. 
Thus we have shown that the group of automorphisms induced in S/K by 
elements in @s is an epimorphic image of the greup of automorphisms induced 
by @p in P. Since the latter is strictly p-closed, so is the former. Hence 25 
induces a strictly p-closed group of automorphisms in S/K. Since the Sylow 
Tower Property of supersoluble groups is inherited by quotient groups, we 
have shown that (iii) implies the following property: 

(iii*) If K is a 0-admissible normal subgroup of G, then the Sylow Tower 
Property of supersoluble groups is satisfied by G/K = H; and if = is the group 
of automorphisms induced in H by 0, and P is a p-Sylow subgroup of H, then 
yp induces a strictly p-closed group of automorphisms in P. 

It is not difficult to derive (i) from (iii*). For consider a 6-admissible normal 
subgroup K of G. If p is the maximal prime divisor of the order of H = G/K, 
then the p-Sylow subgroup P of H isa characteristic p-subgroup of H because 
of the Sylow Tower Property. Clearly P # 1; and as a characteristic subgroup 
of H, P is =-admissible, if we denote by = the group of automorphisms 
induced in H by @. There exists a minimal 2-admissible subgroup M of H 
which is part of P. Since 6, and hence 2, contains every inner automorphism 
of G and H respectively, M is a normal subgroup of H. Because of (iii*) a 
strictly p-closed group I’ of automorphisms is induced by = in P; and the 
automorphisms in I’ induce a group I'* of automorphisms in the 2-admissible 
subgroup M which is strictly p-closed as an epimorphic image of T. Since M 
is a minimal 2-admissible subgroup of H, I™* is irreducible. Since M is a 
p-group, we may apply Theorem 1.1 to see that M is cyclic of order p. Thus 
we have established the existence of a cyclic normal L-admissible subgroup 
M # 1 of H; and this shows that G, @ is a supersoluble pair. This completes 
the proof of the equivalence of conditions (i) to (iii). 

If G, @ is a supersoluble pair, then G and @ are both supersoluble groups, as 
has been mentioned before, so that both G and @ have the Sylow Tower 
Property of supersoluble groups. Consider now a subgroup = of @ and a 
Z-admissible p-subgroup U of G. Then U, = is a supersoluble pair too so 
that their product UZ is a supersoluble group. Thus every maximal subgroup 
of UZ has index a prime. If V is a maximal 2-admissible subgroup of U, then 

> is a maximal subgroup of UL. Hence [UZ:V2] is a prime; and this prime 
is p since V < U. It follows that (iv) is a consequence of (i). 

Assume finally the validity of (iv). Since @ contains the group of inner 
automorphisms of G which is essentially the same as G/ZG, and since @ has 
the Sylow Tower Property of supersoluble groups, G/ZG has likewise this 
property. But this implies naturally that G itself enjoys the Sylow Tower 
Property of supersoluble groups. Consider a p-Sylow subgroup P of G; and 


SO eT 








SL ee 





SUPERSOLUBLE IMMERSION 365 


denote by I the group of automorphisms induced in P by 6p. Since ¢P is a 
characteristic subgroup of P, automorphisms preserving P will also preserve 
@P. Hence ¢P is @p-admissible. Denote by = a subgroup of @p which induces 
in P/@P a group =* of automorphisms whose order is prime to p. Since =* 
acts on the elementary abelian p-group P/@P, it is completely reducible 
(Maschke’s Theorem) (6, p. 81, Theorem 46). This signifies that every 
=*-admissible subgroup of P/¢P possesses in P/¢P a =*-admissible com- 
plement. By (iv), maximal =*-admissible subgroups of P/¢P have index p 
in P/¢P. Consequently P/¢P is the direct product of cyclic 2*-admissible 
subgroups. Since cyclic subgroups of P/¢P have order p, and since the group 
of automorphisms of a cyclic group of order p is cyclic of order p — 1, it 
follows that =* is abelian of exponent p — 1. We recall the result of P. Hall 
that an automorphism of the p-group P has order a power of p in case it 
induces the identity in P/@P; (§1, Lemma (a)). Combining these results 
we see that a subgroup of I whose order is prime to is abelian of exponent 
p — 1. If the order of T is divisible by p, then p is the maximal prime divisor 
of the order of T. By (iv), the group @, and consequently [ too, have the 
Sylow Tower Property of supersoluble groups. This implies in particular the 
existence of a characteristic p-Sylow subgroup Ip of T. By Schur’s Theorem 
there exists a complement A of Ip in IT. Since A ~ I['/T> is of order prime 
to p, A and consequently ['/T,p is abelian of exponent p — 1. Hence YT is 
strictly p-closed; and thus we have shown that (iii) is a consequence of (iv 
This completes the proof of the equivalence of conditions (i) to (iv). 


THEOREM 3.2. G, 6 is a supersoluble pair if, and only if, G/@G, @ is a super- 
soluble pair. 


Proof. The necessity of our condition is obvious. Assume that G/¢G, @ is a 
supersoluble pair. Since this condition remains valid, if we adjoin the inner 
automorphisms of G to 8, we may assume without loss in generality that the 
inner automorphisms of G belong to @. Consider a 6-admissible normal sub- 
group K of G. Then ¢(G/K) = J/K where J is the intersection of all those 
maximal subgroups of G which contain K. This implies in particular that #G 
is part of J and that consequently K-¢G/K < ¢(G/K). Thus (G/K)/¢(G/K) 
is an epimorphic image of G/¢@G. Since G/@G, @ is a supersoluble pair, 
(G/K)/¢(G/K), @ is likewise a supersoluble pair. Let H = G/K; and denote by 
> the group of automorphisms induced in G/K by @. Since H/@H, = is a super- 
soluble pair, H/¢#H (and the group of automorphisms, induced by = in H/¢H) 
are supersoluble. The supersolubility of H/¢@H implies the supersolubility 
of H; (7, p. 418, Satz 10). If » is the maximal prime divisor of the order of 
H, then H is p-closed and the p-Sylow subgroup P of H is a characteristic 
p-subgroup of H. Thus P is in particular 2-admissible. It is a consequence 
of Theorem 2.2 (b) that ¢P = P ()\ oH. This implies that = induces essentially 
the same group of automorphisms in P/@¢P = P/(P (\ oH] and in P-¢H/¢H. 
The latter group is the p-Sylow subgroup of H/¢H and > is the maximal 











366 REINHOLD BAER 


prime divisor of the order of H/@H. Since H/¢H, = is a supersoluble pair, 
= induces in P-¢H/@H, and hence in P/@P, a strictly p-closed group of 
automorphisms. Denote by I the group of automorphisms induced in P by 
automorphisms in 2; and denote by I'* the subgroup of those automorphisms 
in I’ which induce the identity automorphism in P/¢P. By § 1, Lemma (a), 
the normal subgroup I* of [ is a p-group. Since ['/I* is essentially the same 
as the group of automorphisms induced in P/¢P by 2%, and since the latter 
group is strictly p-closed, we see that T is an extension of a p-group by a 
strictly p-closed group. Hence I itself is strictly p-closed. Since P ¥ 1 is 
z-admissible, there exists a minimal L-admissible subgroup M of H which 
is part of P. Since 6, and hence 2, contains every inner automorphism, M is 
a normal subgroup of H. Because of the minimality of M the group = induces 
in M an irreducible group A of automorphisms. Since A is likewise induced 
by T' (because M < P), A is strictly p-closed. Since M is a p-group, Theorem 
1.1 is applicable. Hence M is cyclic of order p. Thus we have established the 
existence of a cyclic, 6-admissible, normal subgroup M # 1 of G/K. Hence 
G, @ is a supersoluble pair, q.e.d. 


4. The results obtained in § 3 will now be applied to the problem of super- 
soluble immersion. 


THEOREM 4.1. The following properties of the normal subgroup K of G are 
equivalent: 
(i) K is supersolubly immersed in G. 

(ii) K/@K is supersolubly immersed in G/oK. 

(iii) If U ts a p-subgroup of K, then NU/CU is strictly p-closed. 

(iv) K has the Sylow Tower Property of supersoluble groups; and if P is a 
p-Sylow subgroup of K, then NP/CP is strictly p-closed. 

(v) G/CK has the Sylow Tower Property of supersoluble groups; and if P is 
a p-Sylow subgroup of K, Sa subgroup of NP, then maximal S-normalized 
subgroups of P have index p in P. 


Proof. Denote by @ the group of automorphisms induced in K by elements 
in G. Then @ is essentially the same as G/CK. If U is a subgroup of K, then 
6y is just the group of all automorphisms of U which are induced in U by 
elements in the normalizer N U of U in G; and this shows that 6, and NU/CU 
are essentially the same. Note finally that K is supersolubly immersed in 
G if, and only if, K, @ is a supersoluble pair. Since the inner automorphisms 
of K are clearly contained in 6, Theorems 3.1 and 3.2 may be applied, and a 
fairly obvious translation of these results proves the equivalence of properties 
(i) to (v). 


LemMA 4.2. If K is a normal subgroup of G, and if the subgroup S of Gis 
minimal with respect to the property G = KS, then K (\S < oS, and S/S is 
an epimorphic image of G/K. In particular S is supersoluble in case G/K is 
supersoluble. 





qa SS 





nt o> eee 





-—_—~_ — 








SUPERSOLUBLE IMMERSION 367 


Proof. Consider a maximal subgroup T of S. If K (\ S were not part of 7, 
then we could deduce from the maximality of T (and the normality of K (\ S 
in S) that S = (K (\ S)T. Consequently 


G = KS = K(K(\S)T = KT. 


But 7 < S, contradicting the minimality of S. Thus we see that K /\S is 
part of every maximal subgroup of S; in other words: K (\ S < #S. Next we 
note the isomorphy G/K ~ S/(K (\ S). From K (\ S < #S we may deduce 
therefore that S/@S is an epimorphic image of G/K. Thus supersolubility of 
G/K implies the supersolubility of S/@S; and the latter implies, by Huppert’s 
Theorem, the supersolubility of S. 


Remark. This lemma is, naturally, well known. It has been appended for 
the convenience of the reader. Note that the first part of the lemma has 
many applications of the type given in its second part, since there exist many 
group theoretical properties which, when satisfied by a group, are satisfied 
by its epimorphic images, and which, when satisfied modulo the Frattini 
subgroup, are satisfied by the group itself; for instance, nilpotency, dispersion, 
etc. 


THEOREM 4.3. The normal subgroup K of G is supersolubly immersed in G if, 
and only tf, 

(a) G induces in K a supersoluble group of automorphisms and 

(b) the supersolubility of the subgroup S of G implies the supersolubility of KS. 


Proof. The necessity of these conditions we have pointed out before. If the 
conditions (a) and (b) are satisfied by the normal subgroup K of G, then we 
select among the subgroups X of G satisfying G = X-CK a minimal one, 
say S. Since the group of automorphisms, induced in K by elements in G, 
is essentially the same as G/CK, this group is supersoluble by (a). Applica- 
tion of Lemma 4.2 shows the supersolubility of S. Application of (b) shows 
the supersolubility of KS. Denote now by @ the group of automorphisms 
induced in K by elements in G. Because of G = S-CK the elements in KS 
induce in K the same group @ of automorphisms. Since AS is supersoluble, 
the pair K,@ is a supersoluble pair. But then clearly K is supersolubly im- 
mersed in G, q.e.d. 


We have pointed out before that the product =,G of all the supersolubly 
immersed normal subgroups of G is itself a supersolubly immersed characteristic 
subgroup of G. If we denote by 2oG the product of all normal subgroups X of 
G such that XS is supersoluble whenever S is a supersoluble subgroup of G, then 
X0G is a characteristic subgroup of G satisfying the same property (b) of 
Theorem 4.3. Denote finally by =G the intersection of all normal subgroups X 
of G with supersoluble G/X. Since direct products and subgroups of super- 
soluble groups are themselves supersoluble, 2G is a characteristic subgroup 
of G with supersoluble quotient group G/=G. 











368 REINHOLD BAER 


COROLLARY 4.4. £,G = SoGC\ CG. 


Proof. \f K is a supersolubly immersed normal subgroup of G, then we 
deduce K < 2»G from Theorem 4.3 (b) and 2G < CK from Theorem 4.3 (a). 
The latter inequality implies K < CG. Thus we have shown that 


rG < WG) CrG. 


Let D = 2»G\ CEG. Then D is a characteristic subgroup of G which 
satisfies D < CEG and hence =G < CD, implying the validity of condition 
(a) of Theorem 4.3. From D < 2oG we deduce the validity of condition (b) 
of Theorem 4.3. It follows that D is supersolubly immersed in G. Hence 
D < =,G, completing the proof. 

It is worth noting in this context that, in general, 2,G < 2G, and that 
products of supersoluble normal subgroups will, in general, not be super- 
soluble. 

Slightly generalizing the concept of a supersoluble pair we term the pair 
G,@ (for @ a group of automorphisms of the group G) an almost supersoluble 
pair, if G, = is, for every supersoluble subgroup = of 0, a supersoluble pair. If the 
pair G,@ is almost supersoluble, then G, = is a supersoluble pair for every 
Sylow subgroup = of 6. The converse is false, as may be seen from the following 


Example. Let p be a prime, g an odd prime divisor of p — 1 (for instance, 
b = 7, q = 3). Then 2g is a factor of p — 1. There exists one and essentially 
only one non-abelian group @ of order 2g; and @ possesses a normal subgroup 
of order g and index 2, its only proper normal subgroup. It follows among 
other things that @ is supersoluble. There exists an elementary abelian p-group 
A of order p**; and there exists a group of automorphisms of A which is 
isomorphic to @ and which we shall denote by @. Since Sylow subgroups of @ 
are cyclic of order g or 2, they are strictly p-closed. Hence every pair A, 2, 
for = a Sylow subgroup of @, is supersoluble. But @ itself is not strictly p-closed, 
though it is a group of automorphisms of the p-group A. Hence A, @ is not a 
supersoluble pair. 

The connection between the concept of an almost supersoluble pair and 
our preceding discussion is effected by the following fairly obvious remark: 
If K is a normal subgroup of G and @ the group of automorphisms induced 
in K by the elements in G, then the pair K, @ is almost supersoluble if, and 
only if, K.S is supersoluble whenever S is a supersoluble subgroup of G (Theorem 


4.3). 





—— ee ~ 





es 


—— 











SUPERSOLUBLE IMMERSION 369 


REFERENCES 


R. Baer, Supersoluble groups, Proc. Amer. Math. Soc., 6 (1955), 16-32. 
, Classes of finite groups and their properties, Illinois Jour. Math., 1 (1957), 115-187. 
, Closure and dispersion, Ulinois Jour. Math., 3 (1959), 000-000. 

P. Hall, A contribution to the theory of groups of prime power order, Proc. Lond. Math. Soc., 

36 (1932), 29-95. 

5. B. Huppert, Normalteiler und maximale Untergruppen endlicher Gruppen, Math. Zeit., 60 
(1954), 409-34. 

6. N. Jacobson, The Theory of Rings (New York, 1943). 

7. H. Zassenhaus, Lehrbuch der Gruppentheorie, I (Leipzig und Berlin, 1937). 


Pr ePr> 


Frankfurt am Main 











NON-LINEAR RECURSIVE SEQUENCES 
ELBERT A. WALKER 


The purpose of this paper is to investigate non-linear recursive sequences 
of maximum length with elements from GF(2). In particular, the question 
of whether or not a recursive sequence of maximum length can be equal to 
its dual is settled. This question, as far as the author knows, was originally 
asked by Rosser. Part I contains the necessary background for Part II, and 
in the main is a condensation of some unpublished work (1955) of W. A. Blank- 


inship and R. P. Dilworth. 


Part I 


1. Let GF(2) be the field with two elements, let » be a positive integer, 
let S be the Cartesian product of m copies of GF(2), and let f be a mapping 
from S into GF(2). A sequence a, de, a3, ... of elements in GF(2) is said to 
be recursively generated by f if 


Guts = f(s, Bi41, ~~ + » Onts—1) wink Ss... 


f is called a recursion or a rule of generation. The sequence a, @2, a3,... is 
called a recursive sequence of span < n. It is of span m if in addition it is not 
of span < m — 1. The elements of © will be called patterns, or n-bit words, 
and the elements of GF(2) will sometimes be called bits, and denoted by 
0, 1. The mapping f induces a mapping F of © into © in the following way. 
If S = (a, a2,...,@,) is in S, let F(S) = (G2, @3,...,@n, f(Gi,...,G,))- 
Hence with the mapping f of S into GF(2), we associate the mapping F of 
S into S, and F uniquely determines f. Distinct mappings f; and f, of S into 
GF(2) induce distinct mappings F,; and Fy: of © into ©. If F is one-to-one, 
then f is said to be a non-singular recursion. Otherwise f is called singular. 
All recursions considered here will be assumed to be non-singular unless 


otherwise stated. 


2. Let f be a recursion, let F be the mapping of S into S determined by 


S 
f and let S; be in ©. Let S,; = F(S;_1), i = 2,3,4,.... Since © is finite, 
there exists a smallest positive integer m such that S,,,; = S;. Thus f generates 
a cycle of elements in S, namely (S;, So,...,5,). If T1 is some element in 
S not in the cycle (S;, Ss, ...5S,), f generates another cycle (7;, Ts, . . . Tn) 
and these two cycles are disjoint. Continuing in this manner, © is decom- 
posed into disjoint cycles by f. Of course the cycle (S;, S2,...,S,) is con- 


Received October 17, 1958. 
370 


™» 











we 











NON-LINEAR RECURSIVE SEQUENCES 371 


sidered the same as (S2, S;,..., Sm, 51). This collection of cycles determined 
by f is denoted by C, and it is easy to see that f is uniquely determined by C. 
The system C of cycles is called the cyclic structure of f. Since F is one-to-one, 
it is onto, and so is a permutation of ©. If this permutation is decomposed 
into the product of disjoint cycles, this collection of cycles is identical with 
C. Thus the cyclic structure of the permutation F is identical with the cyclic 
structure of f. The sum of the lengths of the cycles in C is 2", where m is the 
span of f. If C consists of just one cycle, that cycle is said to be a maximal 
cycle. 


3. Any mapping f of S into GF(2), singular or non-singular, can be repre- 
sented uniquely as a polynomial in x), x2, ... , xX, with coefficients in GF(2). If 


f is linear in x, 


Ff (Gi, G2, ... Gn) = f(@i1 + 1, G2,...,a,) + 1. 


Hence f linear in x; implies f is non-singular. Assume f is non-singular. Then 


f(1,0,0,0,...,0) = 1+ /(0,0,...,0) so that the term x; appears in the 


polynomial representing f. f(1,1,1,...,1) = 1+ /(0,1,1,...,1) so that 
if the polynomial representing f has any non-linear terms with x, as a factor, 
it has an even number of them. If it has at least two, let 

ee ae and a ee Pe 


be distinct and let the first have smallest possible degree. The sets {7,, 72 


t,} and {ji,j2,...,js} are distinct. There is a j, not in {1,7%2,..., i,}. Let 
S = (a, do,...,@,) be the element of S whose first co-ordinate is 1, whose 
i1, 12,..., 4%, Co-ordinates are 1, and the rest of whose co-orinates are 0. Let 
S’ = (a; + 1, ao, a3,...,a,). Then F(S) = F(S’), and F is singular. Hence f 


is linear in x;. Thus f is non-singluar if, and only if, f is dependent on x; and 


linear in x;, and f(x, X2,...,%n) = X1 + fi(x2, X3,...,%n), where f; is a 
polynomial in x2, X3,..., Xn. 
4. Let S = (a;, a2,...,a@,) be a pattern in S, and let S = (a; + 1, a@,..., 


a,). Let fs be the mapping from © into GF(2) that is 1 only at S and 8. 
Explicitly, 
n 


I] (1 + a;,+ %x;). 


i=2 


f.s(x1, see » Mea) 


Note that fs = fs. Let C be the system of cycles of f. Suppose S and § are on 
distinct cycles in C. Let (S;, S2,...,5S,) be the cycle containing S, and let 
(T:, T2,..., Tm) be the cycle containing S. For convenience, let S; = S and 
T, = 8. Then the system C’ of cycles of f + fs consists of the cycle (S;, 72, 
ae ee S;,...,S) and the remaining cycles of C unchanged. 
Suppose S and S are on the same cycle (S;, S2,..., S,) in C. Let S; = S and 
S, = §. Then the system C’ of cycles of f + fs consists of the cycles (5), S,+:, 
.., 5x), (S,, So, ...,S,-1), and the remaining cycles of C unchanged. 











372 ELBERT A. WALKER 


5. Let f be a recursion that generates a cycle C; = (S;, So, ... , Sy). Suppose 
this cycle C, has the property that if it contains the pattern (a, de, ... , d,) 
then it contains (a; + 1, de, a3, ...,@,). Let (01, be,...,5,) be any pattern. 
C, contains a pattern ending in 5, (@;, de, . . . , @n—1, 51). If this pattern is not 
followed in C, by (@2, as, ... , @n—1, 51, 62), then the pattern (a; + 1, a2,..., 
@,—1, 6;), which is in C,, is followed by (dz, a3, ... , @n—1, 51, 62). Continuing 
in this manner, one gets the pattern (;, b:,..., 5,) in Cy. Hence the system 


of cycles of f consists simply of the one cycle C,. If a recursion f generates 
more than one cycle, then every cycle it generates has the property that it 
contains a pattern S such that it does not contain §. Therefore f + fs generates 
one less cycle than does f. In general, if f generates k cycles, then there exist 
k —1 patterns S,,...,S,-1 such that 


kl 


f+ fs: 


i=l 
generates just one cycle. 
Part II 


An unsolved problem concerning non-linear recursive sequences is that of 
finding a large class of recursions which generate maximal cycles. It is known 
(1) that the number of such recursions of span n is 


eee) 
We begin this section by deriving some elementary properties a recursion 
must have if it generates a maximal cycle. Later we define and investigate 
the reverse, the dual ,and the reverse-dual of a recursion. 
1. Let f be a recursion of span m that generates a maximal cycle. Then 
Sf (H1, Xa, . . - Xn) = 1 + X11 + O(X2, Xs, ..., Xe), 
where @ (x2, X3,...,%X,) is a polynomial with no constant term. 


Proof. Since f generates a maximal cycle, every n-bit word must occur in 
that cycle. Thus the n-bit word (0,0,...,0) is not a rut, that is f(0,0,..., 
0) = 1. 


2. Let f be a recursion of span m > 1 that generates a maximal cycle. Then 


the polynomial f(x:,...,x,) that represents f does not contain all the linear 
terms X32, ..., Xn. 

Proof. The n-bit word (0,0,...,0) is followed by a 1. If f(x1,...,x,) has 
the term x,, then the m + 1-bit word (0,0,...,0,1) is followed by 0, and 
if it also has the term x,_,:, this pattern is followed by 0, etc. If f(x1,... , x9) 
has all the linear terms x2,...,%X,, we get the sequence 0,0,...,0, 1, 0,0, 


...,0 since f(x:,...,X,) contains the linear term x;. But if m > 1, this 








— ee ow oe 


























NON-LINEAR RECURSIVE SEQUENCES 373 


implies that f does not generate a maximal cycle. Therefore f(x, .. . , x,) does 
not contain all those terms. 


3. Let f be a recursion of span m that generates a maximal cycle. Then 
the polynomial that represents f has an even number of terms. 


Proof. The n-bit word (1 ,1,..., 1) is followed by a 0. Hence f(1, 1,..., 1) 
= 0 and so f(x:,...,X,) has an even number of terms. 


4. Definitions. One cycle is the reverse of another if either cycle can be 
obtained from the other by taking the bits in reserve order. One cycle is the 
dual of another if either can be found from the other by replacing all 0’s by 
1’s and all 1’s by 0’s. One cycle is the reverse-dual of another if it is the reverse 
of the dual (the same as the dual of the reverse) of the other. The recursion 
corresponding to the reverses of the cycles generated by f is called the reverse 
of f, and is denoted by Rf. The recursion corresponding to the duals of the 
cycles generated by f is called the dual of f, and is denoted by Df. The recursion 
corresponding to the reverse-dual of the cycles of f is called the reverse-dual 


of f, and is denoted by RDf. 


5. It is fairly clear that the cyclic structure of f, Rf, Df, and RDf are the 
same as far as the number of cycles in each and the lengths of cycles in each 


are concerned. In particular, if f generates a maximal cycle, then so do Df, 
Rf, and RDf. 


6. Since every recursion f can be represented by a polynomial, it is of 
some interest to determine the polynomials representing Rf, Df, and RDf in 
terms of the one representing f. A moment's reflection shows that if 


J (er, Ba, «2 sp Sa) = Ba + fa(Sa, .. - > Sn) 

then 
RF Sa «oo Sa) @ Se + Falmer ss +e Bs 

and 


Df (x1, sees Xa) = X; + fill + Xe, l + Bae cces l + Xa) 
From these follow then that 


RDf (x1, ...,%n) = X1 + SHilae, +1,...,%2 + 1). 


7. Since f(x1,...,%n) = X1 + fi(x2,...,%,) implies that 
Rf (x1, .. ~ Xn) = X1 + filXn,..- , X2), 
we see that f and Rf agree on those patterns which are symmetric in the 


last (n — 1) bits. 


8. Suppose a cycle (So, S:,...,S,-1) is the same as its reverse. Suppose 











374 ELBERT A. WALKER 


this cycle contains a pattern which is its own reverse, and for convenience 
let it be So. Let S,’ be the reverse of the pattern S;. Then Sp = So’, S; = S,-1’, 

.., 5S, = S_/,.... There is at most one j such that j = k — j, namely 
j = 3k. Therefore, if a cycle is the same as its reverse, then it contains at 
most two patterns which are their own reverses. If k is odd, there is at most 
one such pattern, and as a matter of fact, exactly one such pattern. It is 
possible for a cycle of even length to be its own reverse and contain no pattern 
which is its own reverse. For example, the cycle ( (01), (10) ) is such a 
cycle. Now, using these facts we can prove the following theorem. 


Tueoreom. If f = Rf, then f generates at least 2'*"+4-! cycles. If n > 3, then 
f does not generate a maximal cycle. 


Proof. There are 20+!) n-bit words which are their own reverses. Since 
f = Rf, acycle which contains one of these n-bit words is its own reverse. But 
a cycle that is its own reverse can contain at most two n-bit words that are 
their own reverses. Hence there must be at least 2!"+4!-! cycles generated 
by f. If nm > 3, 2"+41-1 > 2, so that f does not generate a maximal cycle. 


9. We are now going to settle the question as to whether or not a cycle 
of maximal length can be equal to its dual. It has just been shown that if 
n > 3, a maximal cycle of span m is not equal to its reverse. The corresponding 
statement is true for the dual of a maximal cycle, but the proof of it is a 
little more complicated. We begin with a lemma, of which no proof seems 
readily available in the literature. 


LEMMA. If f(x1,...,%n) = 1, then the number of cycles generated by f is 
even, for n > 2. 

Proof. If aya2...d, is any pattern, then the sequence obtained beginning 
with this pattern is @)@2...@,@:@2...a,.... Therefore to compute the 


number of cycles generated by f is the same problem as computing the number 
of strings of beads of length m that can be constructed using two kinds of 
beads, where two strings are considered the same if one is a rotation of the 
other. It is easily verified that this number is 


din d : 
where F(1) = 2 and 


F(k) = 2° — >> F(r). 


F(d) is, in fact, the number of patterns that are equal to themselves at slides 
of multiples of d only. Hence such a pattern and its slides contain exactly d 
distinct patterns, from which it follows that d~'F(d) is the number of strings 
of beads m long of this nature. Summing over all divisors d of m yields the 





oa 





—_— 




















NON-LINEAR RECURSIVE SEQUENCES 375 


total number of strings of beads. This number G(m) we wish to show is even. 


Observing that 


> Fd) = 


din 


and applying the Mébius inversion formula yields 
F(n) = p> u(n/d)2*. 


Therefore we get 


If m is odd we see immediately that ra) and hence d-'F(d) is even for all 
d\n. Thus we need consider only the case where 
d = 2*pi" pe” Ye 


where a > 0, and fi, po,..., ps are distinct odd primes.* Now 
F(d) = Dis u(r)2*" 
rid 


and each non-vanishing term in the sum has a factor 


1 


92-19" 1~ 1p9%2- .«-De%e~! 


Hence 
F(d) = m2” 


where m is an integer. Since d| F(d), 


pip p as m 
Put 
m= up,'ps*...p,", 
where u is an integer. Then 


F@) _ (29-194; —1p9%.-1!...p9%,-!—a) 
d u2 A 


A necessary condition that d-'F(d) be odd is a; = a2 =... =a, = 1 and 
= 1 or 2; that is, if d = 2pipe... p, or d = 4pipe... p,. We show in each 
of these cases that d-'F(d) is odd. Let d = 2pipo... p,. Then 


F(d) = 2°42%42%+...42 


where d, d;, d2,..., are all the divisors of d. Hence F(d) is twice an odd 
number and since d|F(d), d~'F(d) is odd. If d = 4pipe... p, then since 


F(d) = > u(r)2"”” 


*The author wishes to thank the referee for furnishing a correct proof for this case. 











376 ELBERT A. WALKER 


and u(r) = 0 if 4 divides r, 
F(d) = 2%242°242%+...42' 
where d, d;, dz, ..., 2, are all the even divisors of d. Hence F(d) is four times 
an odd number and since d| F(d), d~'F(d) is odd. Now let 
n= 2*pi pe” see ~~. 


Since n > 2, either s > 0 or a> 1. If s = 0, m = 2%, a> 1 and 


- 


Fd) _ FQ) , F2) , FQ), & FO’) 
> eee 2 + 4 +2 2’ 


r=3 
2+1+3+ , om (even numbers). 


Hence G(m) is even. If s > 0, a = 0, then each term of 


F(@) 
p> d 


din 


G(n) 


is even. If s > 0, a = 1, then the divisors d, for which d-'F(d) is odd, are 
the numbers d = 29192 . . . g,, where qi, G2, . . . , Gr is a subset of py, po,..., Ds. 
The number of such divisors is 2* so that G(m) is even. Finally, if s > 0,a > 1, 
the divisors d, for which d-'F(d) is odd, are of the forms d = 29:92... 4, or 
d = 49:92... gr where qi, G2, ..., @, is a subset of pi, po, ..., P,. The number 
of such divisors is 2**'. Hence, again G(m) is even. 


10. Let f and g be recursions of span n. If f and g disagree on the n-bit 
word S, then from 4, Part I, we see that f + fs and g agree on S and § and 
on all patterns on which f and g agree. Thus the recursion f may be changed 
into the recursion g by adding a suitable set of fs’s to f. From 5, Part I, we 
see that adding hs to any recursion h changes the parity of the number of 
cycles generated by h. If S = (ai, ao,...,a,) then 


hs(x1,...,%.) = I] (1 + a,+ x,), 


t=—_2 


and adding hs to h adds the term x2x;...x,, among other terms, to the 
polynomial representing A. If one adds an odd number of hs’s to h one adds 
the term x2x3...x,, among other terms, to the polynomial representing h. 
Now let f be the recursion such that f(x:,...,%x,) = x, and let g be any 
recursion of the same span that generates an odd number of cycles. If m > 2, 
f generates an even number of cycles, so to change f into g requires the adding 
of an odd number of fs’s to f, and hence the adding of the term xox;.. . Xn, 
among other terms to the polynomial representing f, which is x;. Hence the 
polynomial representing g has the term xox;...x,. Conversely, if g is any 
recursion of span m such that the polynomial representing it has the term 
XoX3...2%,, then one must add an odd number of fs’s to f to get g, and this 


implies that g generates an odd number of cycles. These remarks we sum up 
in the following theorem. 








an 


th 











ae 











NON-LINEAR RECURSIVE SEQUENCES 377 


THEOREM. A recursion of span n > 2 generates an odd number of cycles if, 
and only if, the polynomial representing it has the term xox... Xn. 


CorOLLARY. If a recursion of span n > 2 generates a maximal cycle, then 
the polynomial representing it has the term xox3.. . Xp. 


11. We are now in a position to prove that if m > 2 and f generates a 
maximal cycle, then f # Df. In fact, we will prove a more general result. 


THEOREM. Jf n > 2 and a recursion f generates an odd number of cycles, 
then f # Df. 
Proof. From 6, Part II, it is easy to see that if the polynomial representing 
f contains a term 
See oo > Ber 


then this term is a term of the polynomial representing Df if, and only if, 


Seta «ss. Be 


is a factor of an odd number of terms of the polynomial representing f. Since 
f generates an odd number of cycles, the polynomial representing it contains 


the term xox;...%,. If that polynomial contains a term besides 1, x;, and 
XoX3...X,, then it contains a term which is a factor of only itself and 
XoX3...X,. That term then is a factor of an even number of terms, and there- 


fore is not a term of the polynomial representing Df. If 


Sf (%1,...,%n) = 1 +X + XoKg.. . Xq OF X1 + X2%3.. . Xn 


then 


Df (x1, ... 5%) = 1 +1 + (1 + x2) (1 + x3)... (1 + x) 
or x; + (1 + x2)(1 + 23)...(1 +2%,), 


and is obviously not the same as f(x,,..., X,). Hence in any case, f # Df. 


CoROLLARY. Jf n > 2 and f is a recursion generating a maximal cycle, then 
f # Df. 


12. THEOREM. If n is even and if f is a recursion of span n > 2 which generates 
an odd number of cycles, then Rf # Df, and hence f # RDf. 


Proof. From 6, Part II, it follows that the polynomials representing the 
recursions f and Rf have the same structure with regard to the number of 
terms which are the product of a given number of variables. There are n — 1 
possible terms which are the product of m — 2 variables, namely xx... X,, 
NaN, .. Xny sss» X2K3...X_-1. Since the polynomial representing f contains 
the term xox;...X,, we see from the proof of the theorem in 11, Part II, 
that the polynomial representing Df contains precisely those terms which 
are the product of m — 2 variables that the polynomial representing f does 











378 ELBERT A. WALKER 
not contain. Thus for Rf to be equal to Df it is necessary that m — 1 be even. 
If m is even then m — 1 is odd so that Rf # Df and f # RDf. 


CorROLLary. If m > 2 is even and f generates a maximal cycle then Rf # Df 
and RDf # f. 


For n odd it can happen that Rf = Df. It happens in the case m = 5, as 
shown by the polynomial 


f(x, Xo, Xa, X4, Xs) = 1 + Hy + Xq + Xs + MaXs + Xo a%5 + Ko arXq + KoXgr ars. 


REFERENCE 


1. N. de Bruijn, A combinatorial problem, Koninklijke Nederlandse Akademie van Weten- 
schappen, Proceedings, 49 (Part 2) (1946), 758-64. 


New Mexico State Unwwersity 





EL 














EXTREMAL PROPERTIES OF HERMITIAN MATRICES. II 
M. MARCUS, B. N. MOYLS, AND R. WESTWICK 


1. Introduction. Let H be an n-square Hermitian matrix with eigen- 
values h; > hz >... > ty. Fan (2) showed that 


k 
= > (Ax, x) = > hy, 
j=l j=l 


| 
(1) 
k k 
| min z (Hx,, xj) = } In—e+3 
( j=l j=l 
k = 1,2,...,m, where the max and min are taken over all sets of & ortho- 


normal (o.n.) vectors in unitary m-space V,. Marcus and McGregor (3) have 
generalized this result in the case that H is non-negative Hermitian. For 
vectors %1,...,%7,7 < m,in V,, letx; A x2 A... A x,denote the Grassmann 
exterior product of the x;; it is a vector in V,,, where 


n 
m=("). 
r 
The rth compound of H is a Hermitian transformation of V,, defined by 


C(H) aA... A, @ HxA... A Hey. 


For 1 <r <k <n, denote by Q,, the set of (*) distinct sequences w = 
, ] 


'i,;,...,4-} of integers such that 1 < i; <... <i, < k. Fora set of vectors 
ea bet x, in V,, set 

Xe = Xu A... A Xiy- 
Let 
(2) g = g(x1,...,%) = > (C,(H) xu, Xw), 

weQir 

and let E,(a;,...,a,) be the rth elementary symmetric function of the num- 
bers a;,...,@,. Marcus and McGregor showed that 

Imax g = E,(hy,... , My) 
(3) ; g 1 k 


(min g = E,(Iip—nai,- ~~ Mn), 


where the max and min are taken over all sets of k o.n. vectors x;,...,: x, in 
V,. This result reduces to (1) when r = 1. In the present note we extend this 
result to the case where H is an arbitrary Hermitian matrix. 


Received July 23, 1958. The work of the first author was supported in part by United States 
National Science Foundation Research Grant NSF-G 5416; that of the second author by the 
United States Air Force Office of Scientific Research, Air Research and Development Com- 
mand; that of the third author by the National Research Council of Canada. 


379 








380 M. MARCUS, B. N. MOYLS, AND R. WESTWICK 


2. Results. 


THEOREM. Let 1 < r < k < m and let H be a Hermitian matrix with eigen- 
values hi >... > hn. Then 


max g = max E,(hy, . . . , hts, bp—ntotts- ++» Mn)* 
(4) o<s<k 
min g = min E,(h;,... , he, Ae—-ntett) >>> An)s 
on sc 
where the max and min of g are taken over all sets of k o.n. vectors x1,...,X, im 
pm 
Proof. Let L = L(x1,...,x,) denote the subspace spanned by the o.n. 
vectors x;,..., X,; and let P be the orthogonal projection of V, into L. Then, 


since P is Hermitian, 


g(x1,.--,%) = Dd. (C,(H)xe, CP) xe) 


weQkr 


= > (C,(PH) xe, Xe) 


weQkr 
= trace of C,(A) 
= E,(\1, ooes Az), 


where A is the Hermitian transformation PH restricted to L, and \; >... > 
d, are the eigenvalues of A. It is known (1, p. 33) that for 1 <j < k, 
(5) h; > A; > Nn—n+}- 

Let R,(h) be the set of real k-tuples \ = (Ai,..., Az), Ar >... > Az, Satis- 
fying the inequalities (5). Thus the values of g are bounded by the extreme 
values of E,(A) = E,(Ay,..., Ax) as A ranges over R,(k). We shall discuss 
the maximum value of E,(A) in the following lemmas. Corresponding results 
hold for the minimum. For the moment we restrict ourselves to the case in 
which the A, are distinct. 


LEMMA |. Leth, >... > hy, be given real numbers. Let 1 <r < k <n, and 
let 
(6) y = max E£,()). 
AeRE(A) 


Then there exists weR,(h) such that 
(7) E,(u) = ¥ 
and py, >...> pM. 
Proof. When r = 1, the unique solution of (7) is: uw, = hj, 7 = 1,..., k. 
Hence suppose that 2 <r < k. 


Let 7;,;(h) be the set of A = (Ay,...,Ax) © Re(hk) such that E,(A) = ¥ 
and A; >... > Ay. Then 7;:(h) is not void by the continuity of the elemen- 


*If s = 0 (or k) the initial (or terminal) segment is missing. 






































EXTREMAL PROPERTIES OF HERMITIAN MATRICES 381 


tary symmetric functions. Let m be the least integer such that 7;,,,(4) is not 
void. Then m must equal & for, if not, we shall show that there exists » € 
Tx,.m+1(h). Suppose then that uw € Tim (hk), where 


(8) Mi > eee DO hm = ee = Me > Meet D+ s DP Me 


From (5) and (8) we have 


(9) lim > hmgi > mt = Bm = Mer = Me PD An—ee—1 > An—ese- 
Furthermore, 
(10) E,(u) = pmE-1(im) + Er(im) 
= wr 1 ( A) + E,(i,) 
where E,(fi;) means E,(y1,..., Mjy-1) Mypty ++ me). (If r = k, E,(f,) = 0.) 


Now E,-:1(fm) = E,-1(f,) = 0. For, if E,(Z,) > 0, then for uw’ = (m,... 
in Se G, oc « 6 Side 


E,(z’) = (Um + 3) E,~-1(iim) + E, (im) > E,(@) 


for 6 > 0, and, by (8) and (9), uw’ © R,(h) for 6 sufficiently small. This contra- 
dicts (6). Similarly, if E,,(f@,) < 0, E(u”) > E,(u) for uw” = (w,..., Ke 
a asd ux). Hence E,(u) = E,(f,) is independent of up». Set v, = uw, for 
j # m, and choose vm, > um, SO that ym, <A, and vm, < y¥m—i (if m > 1). Then 
vy © TemiilA). 


LemMA 2. Under the hypotheses of Lemma 1, 


(11) + om gue Ba, « . . > Bes Racanetts «+ + > Made 
O< sk 
Proof. Since the lemma is obviously true when r = 1, and also when k = 

suppose that 2 <r < k < n. By Lemma 1, 7,.(A) is not empty. Let S,,(h), 
1<q<bk, be the set of those A € 7.(h) for which A, = hy, j = 1,..., q: 
and let Sio(h) be the set of X € 7T,(h) for which A; < hy. Let s be the largest 
integer such that S,,(h) is not empty. If s = Rk, there is nothing to prove. 
Otherwise let uw € S,,(h). Then 


My = hgtey JG = SH+1,...,k; 


for, if not, we shall show that there exists vy € S,,4:(4), contradicting the 
choice of s. 


Let ¢t be the least integer greater than s for which wu, > hy_e4,. If t = s + 1, 
h, > uw, by the maximality of s; while if > s+ 1 


h, > Nin—e+t-1 = My-1 > Me. 
Thus 
h. > Mt > ha—e++- 


It follows that E,_,(f,) = 0, since otherwise we could vary u, up or down to 
increase E,(u) (see (10)) while keeping uw in 7),(A). 











382 M. MARCUS, B. N. MOYLS, AND R. WESTWICK 


Thus 

(12) E,(u) = E,(a) 

Set 
Vy = py,jJ = 1,...,58, (if s > 0) 
Vert = Ages, 
V3 = My1j =S+2,...,¢, (if*>s+1) 
vy, = py, j =t+1,...,hk, (if k > 2). 


In effect, u, is replaced by /,4:, and the resulting u,'s are re-indexed to restore 
the ordering. By (12), E,(v) = E,(u). It is then a straightforward matter to 
verify that vy € S,..4:(h). This completes the proof of the lemma. 

We are now in a position to complete the proof of the theorem. If the eigen- 


values of H are distinct, then for o.n. x1, ... , Xx, 
g(x1,...,%.) < max E,(A) 
AERA) 
= E,(h, see » Be Nin—e+s+1) see hp). 
for some s,0 < s < k. Now g attains this value for o.n. eigenvectors y;, .. . , ¥x 
corresponding to fy, ... , As, An—x+s41,-- +» An, respectively. Thus 


max g = max E,(hy, . . . , Ws, Mn—ereras - ++» Mn): 
O< s<k 
A similar result holds for the minimum. That these results remain valid when 
the eigenvalues of H are not all different follows by a continuity argument. 


REFERENCES 


1. R. Courant and D. Hilbert, Methods of mathematical physics, vol. 1 (New York, 1953). 

2. Ky Fan, On a theorem of Weyl concerning eigenvalues of linear transformations, I, Proc. 
N.A.S. (U.S.A.), 85 (1949), 652-5. 

3. M. Marcus and J. L. McGregor, Extremal properties of Hermitian matrices, Can. J. Math., 
8 (1956), 524-31. 


The University of British Columbia 











LINEAR TRANSFORMATIONS ON ALGEBRAS OF 
MATRICES: THE INVARIANCE OF THE 
ELEMENTARY SYMMETRIC FUNCTIONS 


MARVIN MARCUS AND ROGER PURVES 


1. Introduction. In this paper we examine the structure of certain 
linear transformations T on the algebra of m-square matrices M,, into itself. 
In particular if A € M, let E,(A) be the rth elementary symmetric function 
of the eigenvalues of A. Our main result states that if 4<r<n—1 and 
E,(T(A)) = E,(A) for A € M, then T is essentially (modulo taking the trans- 
pose and multiplying by a constant) a similarity transformation: 


T: A — SAS". 


No such result as this is true for r = 1, 2 and we shall exhibit certain classes 
of counterexamples. These counterexamples fail to work for r = 3 and the 
structure of those T such that E;(7(A)) = E;(A) forall A € M, is unknown 
to us. In (1) it is established that those T which preserve the rank (deter- 
minant) of every matrix in M, are essentially of the form 7: A — PAQ where 
P and Q are non-singular, (PQ is unimodular). In the first part of what follows, 
we shall improve this result by requiring only that T preserves non-singularity. 
We remark that im general we do not assume that T is multiplicative or anti- 
multiplicative anywhere in the paper. 

We shall collect here the notation to be used throughout. For A € M, let 
A’ = transpose of A, p(A) = rank of A, tr(A) = trace of A, A;, = the ele- 
ment in position (7,7) of A,O, = the m-square zero matrix, and E,, = the 
n-square matrix with 1 at position (7, 7), 0 elsewhere. In addition if A € M, 
and B € M, we define A @ B € M,,, to be the direct sum of A and B. If 
1 < p < n then Q,, will be the set of all sequences of p-tuples w = (4, ... , tp) 
where 1 < i; < ig <... <i, < m. A transformation 7: M,— M, will be 
called a direct product if there exists a scalar c and fixed U and V in M, such 
that 

T(A) = cUAV 
or 
T(A) = cUA'V 


for all A € M,. This is motivated by the fact that the mapping 7: A — UAV 
has a matrix representation V’ X U, the direct product of V’ and U, witha 


Received October 17, 1958. The work of the first author was supported by United States 
National Science Foundation Grant NSF-G5416. The work of the second author was sup- 
ported by the National Research Council of Canada. 


383 











384 MARVIN MARCUS AND ROGER PURVES 


proper choice of co-ordinate system for M,. We remark that the mapping 
T: A — A’ cannot be accomplished by pre- and post-multiplication by fixed 
matrices U and V for all A. We shall also denote by e.v.(A) the set of all n 
eigenvalues of A counting multiplicities. , 

2. Linear maps of GL, into itself. As usual, GL, is the group of n-square 
non-singular matrices in M,. We shall determine all 7 such that T(GL,) C 
GL,. 


LEMMA 2.1. If 0 # A € M, then A is similar to a matrix B with By ¥ 0, 
¢=m1,...,%. 


Proof. We may assume A is in Jordan form. It is known in general that A 
is similar to a matrix with tr(A)/m in position (i,7),i = 1,...,m. Hence we 
may assume tr(A) = 0. If A = Ey. let u, be the vector with all entries 1 and 
let u2 be the vector with first entry 1 — m and the remaining entries 1. Norma- 
lize “u,; and ue and let w3,..., u, be a completion to an orthonormal basis. 
Let U be the orthogonal matrix with u, as column 7. Then the (i, 7) entry of 
VEU’ is uti: # 0. The proof is now completed by induction on n. If 
A € My,,4: is in Jordan form with zero trace we consider first the case that A 
is diagonal. Since A ~ 0 we can assume A,,; # 0 and moreover the matrix 
C € M, obtained by deleting row and column 1 of A is not 0,. By induction 
choose V € M, such that (VCV-'), # 0 for i = 1,...,m. Then 


(1@ V) AQ @ V-) = An © VCV™ 


has all non-zero diagonal elements. If A is not diagonal we can clearly assume 
Ais = 1 and the submatrix C above is not 0,. As before we select V € M, 
such that 


P = (1@V) A(i @ V~) 


has all non-zero entries on the diagonal with the possible exception of P). 
If Pi; = O and d,; is the (1, 1) entry of VCV— then select VU € M2 such that 


. 2 
u( 5.) & 


has non-zero diagonal entries. Then 
B= (U@ I,-1) P(U™ € I,-1) 
is the required matrix. 
LemMaA 2.2. If 0 # A € M, then there is a Z € M, such that 
e.v.(A + Z) (\e.v.(Z) = 0. 


Proof. By Lemma 2.1. choose P € M, such that (P~'AP)«u.0 for i = 1, 
...,m. Let X be defined as follows: 








ile 











LINEAR TRANSFORMATIONS 385 


Xu = 1, $m 1,...,8 

Xi = —_ (P-'AP),;, i>j 

Xi, = 0, i<j. 
Then X has all m eigenvalues l and P-'AP + X haseigenvalues 1 + (P-'AP), 
i=l, , m none of which are 1. Then Z = PX P~' has the required property. 


LemMA 2.3. If T(GL,) C GL, then T is non-singular. 


Proof. We have that if 
det ( — [7(,)]~' T(A)) =0 
for some x then 
det (xJ, — A) =0 
for that x. In other words the distinct elements of e.v. ({7(J)]~'T(A)) form a 


subset of the distinct eigenvalues of A. Now suppose0 # A € M,and 7(A) = 
0. Choose Z € M, by Lemma 2.2 such that 


e.v. (Z) (\e.v.(A + Z) = 
Then 
[7(,)]-" T(A + Z) = [(TU,)]- T(Z) 


and the distinct eigenvalues of [7(J,)]~'T(Z) form a subset of the distinct 
eigenvalues of both A + Z and Z. This shows that A = 0 if T(A) = 0 and 
T is non-singular. 


LemMA 2.4. If T(GL,) C GL, and T(I,) = I, then e.v. (T(A)) = e.v.(A) 
for all A € M,. 


Proof. As in the proof of Lemma 2.3, we know that if T7(A) has a set of n 
distinct eigenvalues then 
e.v.(A) = e.v.(T(A)). 
Since 7—' exists we can say that if B has m distinct eigenvalues then 
e.v.(B) = e.v.(T-'(B)). 


If T(A) has multiple eigenvalues choose a sequence B, converging to T(A) 
such that B, has distinct eigenvalues. The proof is completed using the fact 
that the eigenvalues depend continuously on the elements. 


THEOREM 2.1. Jf T(GL,) C GL, then there exist U and V in GL, such that 
either 
T:A— UAV forall A € M, 
or 
T:A—UA'V forall A € M,. 


hd 











386 MARVIN MARCUS AND ROGER PURVES 


Proof. By Lemma 2.4 the map 


@:A —[T(,)]“'T(A) 
satisfies 
e.v. (¢(A)) = e.v.(A) 


for all A € M,. But by (1: Theorem 2), 
¢(A) = UAU™ 
or 
@(A) = UA’U-. 
Multiplication on the left by 7(J,) completes the proof. 
3. Linear maps preserving the symmetric functions. We now deter- 
mine the structure of those linear 7 on M, to M, such that for each A € M, 
E,(A) = E,(T(A)). 


For each r let the class of all such T be denoted by Y,. It is clear that if 7, 
S € UH, then TS € H,. Also if T € A, and T-' exists then 7J-' € Y,; for 
since any B is in the range of T we have 


E,(B) = E,(TT—(B)) = E,(T-(B)). 
Our first result shows that &, is actually a multiplicative group for r > 2. 


LemoMaA 3.1. Jf r > 2andT € U, then T— exists. Thus UA, is a multiplicative 
group for r > 2. 


Proof. Suppose T7(A) = O, and A # O,. Then 
E,(A + X) = E,(T(A + X)) = E,(T(X)) = E,(X) 


for any X € M,. By Lemma 2.1 there exists P € GL, such that (P-' AP) 
#Ofort=—1,...,%. 
Define X € M, as follows: 


Xu =x ¢=z1,...,7-—1 
Xu = 0 2 Se 
Xi, = 0 i<j 

Xi, = — (PAP); t>j. 


Then 
f(x) = E,(P“AP + X) = E,(A + PXP-") = E,(PXP-") = E,(X) = 0. 


Thus the coefficient of x’-' in the polynomial f,(x) must be 0. This means 
that the sum of the last » — r + 1 entries on the main diagonal of P-'AP 
is 0. Similarly we can show that the sum of any m — r+ 1 is 0. But since 
r>2,"—r+1 <2 and it is clear that (PAP), = 0 (¢ =1,...,n). 
This completes the proof. 








Vi 


al 


fc 








LINEAR TRANSFORMATIONS 387 


LemMaA 3.2. If A € M, and A # O then 
deg det (xA + B) < 1 forall B € M, 
if, and only if, p(A) = 1. 


Proof. We can clearly assume that A is in Jordan canonical form and the 
“if” part of the result is obvious. 

In the other direction we show first that A has at most one non-zero eigen- 
value. Suppose 


Amy ccc Ae 
are the non-zero eigenvalues of A in positions (i,,7,),¢ = 1,...,%. Let B 
be a diagonal matrix with 0 at positions (7,,7,) ¢ = 1,...,% and 1 elsewhere 


on the main diagonal. Then 
deg det (x A+ B) = k= 1. 


Suppose now that A has the single non-zero eigenvalue \ which we may assume 
is in position (1, 1). To show that p(A) = 1 it will suffice to show that the 
elements along the superdiagonal of A are all 0. This is clear for » = 2. If 
n > 2 let a be the largest integer such that there is a 1 at position (a, a + 1) 
of A. Define B as follows: 


By = 0 iti=aa+l 
By = 1 ti#a,a+l 
Basi = 1 

B,, = 0 elsewhere. 


Then 
det (xA + B) = —dAx? — x. 


Thus there must be a 0 at (a, a + 1) and a repetition of this procedure shows 
that there are no 1’s along the superdiagonal when \ # 0. 

Now assume that A = 0 and that the (1, 2) entry of A is 1. Define a as 
above and if a > 2 define B as follows: 


By =0 i=1,2,a,a+1 

By = 1 elsewhere on the main diagonal 
Bay = 1 

B,, = 0 elsewhere off the main diagonal. 


Then 
det (xA + B) = x’. 


In this way all elements (i, i + 1) for 2 < i < m — 1 are shown to be 0. To 
settle position (2, 3) use the test matrix 


B= En @ I,-+. 


for Ex; € M;. This completes the proof. 











388 MARVIN MARCUS AND ROGER PURVES 


LemMA 3.3. If 3 <r <mand A € M,, A # O, then the condition 
deg E, (xA + B) <1 
for all B € M, implies that A has at most one non-zero eigenvalue. 


Proof. We can again assume A is in Jordan canonical form with eigenvalues 


Aa, . «+, An. Let 2:,..., 2%, be indeterminates and let B be the diagonal matrix 
with By = % += 1,...,m. Then 
E(xA+B)= > I] (Au + 2a) 
@w=(i,..., ir)€Qrn kel 


-5(5 5 0s Ts)e 


t=—0 tQrn 8¢Sw aes, Bew—sy 


where 


) > 


8¢Coe 


means the sum over all subsets s, of w with ¢ members and 


Bew—s¢ 


means the product over those elements of w not in s,. Hence for t > 2 we have 
that the coefficient of x‘ in the above sum must be 0 for any choice of 2:, 

.,»2,. From this it is not difficult to show that the ¢th elementary sym- 
metric function of any m — r + ¢ of the \, is 0. Choosing t = 2 we have that 
if all the \, are equal they must all be 0. Assume then that for some yu, ¢, A, ¥ A,. 
Since r > 3 we have that k = n —r +2 < n. Let 


Pies 0 + 09 AGes 
be a choice of k — 1 of the eigenvalues with i; ¥ o,u forj = 1,...,k —1. 
Then 
0 = E; (A, Aa» eves Au—) = A, Ex(vu, cee Au—) a E; (An, eooes Au-1) 


and a similar relation holds for \,. 
We then have 


(Ae — Ap) Bs (Au, --- Awe) = 0. 


If r >3 then k — 1 < m — 2 and this last relation implies that \, = 0 for 
i * o, wp. In this case 


LA, = 0 
and A has at most one non-zero eigenvalue. To settle the case r = 3 let 


E,(\;) denote the tth elementary symmetric function of all the \, for i ¥ j. 
We first note that 


E2(d,... An) = AyEx(A,) + Ex(Q,) = d,E1(,). 
Summing on j we have 
n E2(Ai, cocs An) = 2 E.(Q\1 peoeee An) = (). 





_— 


>. 


Sn, 

















LINEAR TRANSFORMATIONS 


Thus 
d, E: (Ay) = 0. 
Setting 
dines >> A; 
j=l 
we have 


Aj = Ays, ACA, — 5) = 0 


and thus the non-zero eigenvalues of A are all equal to s. This completes 


r = 3. 
LEMMA 3.4. Assume 4 <r < n+ 3 and let A € Muss, A ¥ On. Then 
deg E,(xA + B) <1 forall BE Mass 
if, and only if, p(A) = 1. 


Proof. The “‘if’’ part of the theorem is clear. To prove the “‘only if’ part we 
can assume A is in Jordan canonical form and proceed by induction on n. 
For nm = lorr = n + 3 Lemma 3.2 gives the result. Thus assume r < n + 4 
and by Lemma 3.3 we know that A has at most one non-zero eigenvalue \ 
which we can assume is in position (1, 1). Call the (2, 3) entry e (either 1 or 
0). Define B to be the matrix with 1 in position (3, 2) and r — 3 1’s in any of 
the diagonal positions (7,7) for i > 3, 0’s elsewhere. Then 


E,(xA + B) = X\ € x’. 


Consider first the situation in which \ # 0. Then « = 0 and row 2 and column 
2 of A are both zero. If we restrict B to those matrices with row 2 and column 
2 zero we can apply the induction hypothesis to conclude that the submatrix 
of A obtained by deleting row 2 and column 2 has rank 1. Thus p(A) = 1 as 
well. In case AX = 0 let «; and e€2 be the (1, 2) and (m + 3, m + 4) entries of A 
respectively. Define B as follows: 


Bx = Busans3 = # 
By = 1, 3<qi1<r-2 
B,, = 0 elsewhere. 
Then 
E,(xA + B) = e,€2x? 


and we may assume without loss of generality that «, = 0. But then we can 
apply the induction argument as before to obtain p(A) = 1. 


LemMMA 3.5. If 4<r<n and T € 4%, and p(A) = 1 for A € M, then 
p(T(A)) = 1. 


Proof. Consider the polynomial f,(x) = E,(xT(A) + B). Since 7-' € 4W, 















390 MARVIN MARCUS AND ROGER PURVES 


we have f,(x) = E,(xA + T-'(B)). Since p(A) = 1, deg f,(x) < 1 for all B, 
and by Lemma 3.4 p(7(A)) = 1. 


LemMMA 3.6. [f4 <r <nandT € U, then for every A € M, 
p(T(A)) = (A). 
Proof. Let p(A) = k and select A, 7 = 1,...,% such that p(A,) = 1 and 


k 
A=) A,. 
Then by Lemmas 3.5 and 3.1 
p(T(A)) < k = p(A) = p(T—(T(A)) < p(T(A)). 


We are now in a position to prove our main result concerning the structure 


of H,. 


THEOREM 3.1. Jf 4< r<qn—1andT € &, then there exist U and V in 
M,, such that either 


(i) T:A—UAV forall A € M, 
or 

(ii) T:A—UA'V forallA € M, 
where 

(iii) UV = ef 41,,7r@ = 0 (22). 


Proof. The existence of U and V satisfying (i) and (ii) is an immediate 
consequence of Lemma 3.6 and Theorem 2.1. It is clear that it suffices to 
show that E,(PB) = E,(B) for all B € M, implies that P = eI, with 
rd = 0(2r). Letting C,(B) denote the rth compound of B we have 


tr C,(PB) = tr C,(B) for all B € M,,. 


Hence 


tr{[C,(P) — Tm] C,(B)} = 0. 


This implies immediately that 


CAP) @ I. 

¥7= te) 

By the polar factorization theorem let P = UH where U is unitary and H 

is positive definite Hermitian (p. d. h.). Then 
C,(U)C,(@) _ Tm) 

implies that C,(U) is both unitary and p. d. h. Hence every eigenvalue of 

C,(U) is 1 and this in turn implies that every eigenvalue of U is e‘* forr ¢ = 

0 (2x). Similarly we show H = I, and the result is at hand. 











| 
4 
| 





LINEAR TRANSFORMATIONS 391 


4. The structure of Wj for j = 1,2,3. At this point Theorem 3.1 
together with the results in (1) completely settle the question of the structure 
of &, when r > 4. It is easy to construct singular 7 € W%, (map A into the 
diagonal matrix B with By = Ay). Thus not much can be said about 4%. 
In examining %. we are led to two kinds of counterexamples: (i) those trans- 
formations S € %, which permute the entries of every A € M, in some fixed 
way; (ii) those transformations C € M%, which map A into KoA where 
K € M, and KoA is the Hadamard product of K and A ((Ko A); = 
KyjAis i,j = 1,...,). We shall show that there exist non-trivial examples 
of both types (i) and (ii) in &, but that no such examples exist in Y%;. We re- 
mark here that Lemma 3.4 fails for r = 3; for take A = Exp + Egy € My and 
note that although E; (x A + B) is at most linear in x for B € My, p(A) = 2. 
Thus there is no hope for proving Theorem 3.1 via Lemma 3.4 for r = 3. 

Denote by S, that subset of &, consisting of transformations that rearrange 
the elements of every A € M, in some fixed way. Similarly, let H, denote that 
subset of U, consisting of transformations of the type A ~ Ko A, K € M,. 


THEOREM 4.1. Jf S € S, then S = a; o2 o3 where 

(i) o3 is a permutation of the main diagonal entries only. 

(ii) a2 is a permutation of the set of pairs of entries symmetrically located 
across the main diagonal. 

(iii) o, interchanges symmetrically located entries. 


The proof of Theorem 4.1 is a straightforward enumeration of the possibili- 
ties for images under S of matrices of the types Ey + E,,,i <j and Ey, 
+ Ex, i <j. We omit the details. 


THEOREM 4.2. No element of S: of the types (i), (ii), (iii) in Theorem 4.1 és 
a direct product except the identity map and the transpose map. 


Proof. This is done by showing that any map of the types o:, 72, 73 described 
in Theorem 4.1 maps some non-singular N into a singular matrix. First, suppose 
a; maps the (j,j) entry into the (i,, i,) entry. Choose a permutation 7 of 
1,..., such that r(j) = j and r(i) ¥ i fori # j. Let N be the permutation 
matrix corresponding to x and observe that o;(N) is singular. Next, suppose 
a2 maps (i,j) and (j,i) into (k,/) and (I, k) respectively. Let 

N= Ey + E3 + 7 Eu 
ti. 
and note that N is non-singular and o2(N) is singular. Next, suppose o, inter- 
changes (i, j) and (j, 7) and leaves fixed (k, 1) and (I, k). It is not difficult to 
exhibit non-singular N € M; or M, for which o;(N) is singular and we pro- 
ceed to show that the examples in M, for n > 4 can be reduced to one of the 
cases n = 3 or n = 4. Suppose first that none of the equalities: i = k, i = /, 
j =k, 7 =I holds. Then set Ni = Eyy + En + Ex: + Ew and let the per- 
mutation x of 1,..., be (i 7) (2 i) with corresponding permutation matrix 











392 MARVIN MARCUS AND ROGER PURVES 


P. Then Po (N;) PF’ = Ex. + En + Ex: + Ex. Similarly obtain a permu- 
tation matrix Q such that QPo,(N;)P’Q’ = Ex. + En + Ex + Ess. We are 
then confronted essentially with the case m = 4. If any of the equalities i = k, 
i = 1,7 = k, 7 = 1 holds we can reduce the situation to the case m = 3 by 
a similar device. 

We may describe the structure of H, as follows: 


THEOREM 4.3. If C € H2, C:A—-KoA then Ky; = (Ky)— for i # j and 
either Ky = 1 (¢ = 1,...,”) or Ky = —1 fori=1l1,...,m. 


We omit the proof which consists of a straightforward consideration of the 
possibilities for the 2-square sub-determinants of K. 

We remark at this point that it seems plausible that W, is generated by 
taking only products of elements of S:, H, and maps of the form A — PAP", 
P € GL,. We have been unable to prove this, however. 

The situations for S; and H; are somewhat more involved but we shall use 
a sequence of lemmas to show that: 


S; consists only of the identity map, the transpose map, and maps of the 
form A — PAP’ for P a permutation matrix; H; consists only of the identity 
map and the map A ~ KoA = @ DAD™~ where D is a diagonal matrix and 
6 is a cube root of 1. It is not known to us whether there exist other elements of 
W; which are not direct products. 


LemMaA 4.1. Jf A € M, and A has n elements 1, the rest 0, then forn > r > 1, 
n 
E,(A) — (*) 


Proof. It is clear that since the rth order subdeterminants of A are integers 
that 


if, and only if, A = I,. 


E,(A) < tr {(C,(A)] [C,(A)]’}. 
Hence 


E,(A) < tr C,(AA’) = E,(ai,... , a2) 


where a/,7 = 1,...,m are the eigenvalues of AA’. If p(A) = kandk <r 
it is clear that 
0 = E,(A) < }. 
Otherwise if k >r 
E,(A) < E,(ai,...,02) = E,(ai,..., a2) 


< (*) k~ {Ex(ai,...,a%)}" 


rT 


a (*) ke {tr(A4A’)}" = (*) kn’. 


ST 








LINEAR TRANSFORMATIONS 393 


We consider two cases: 
(i) k = n. Then A is a permutation matrix and all eigenvalues lie on the 
unit circle. Then it is easily seen that 


Ba) = (*) 


implies all the eigenvalues are equal and the only permutation matrix with 
this property is J,. 

(ii) k < _m. We shall show this is impossible. If k = 1, then r = 1 and 
E,(A) = tr (A) = n. But J, is the only matrix satisfying this and this is a 
contradiction. On the other hand, if k > 2 then 


E,(A) < (*) k'n' < +, = E,(A) 


and the proof is complete. 


LemMa 4.2. If S € S; and n > 4 then S either interchanges (i,j) and (j, 1) 
for i # j or leaves them fixed. 


Proof. Since 


Ex(S(I,)) = (”) 
we have S(J,) = I, by Lemma 4.1. Thus we may modify S to obtain 
o¢:A—PS(A)P’ 


where P € M, is such a permutation matrix that ¢ holds the main diagonal 
elements fixed. Now let 


No 0,2 ® J 


01 
n=(‘ ». 


We show first that ¢(No) = No. If this were not the case we have two possible 
alternatives: 

(i) ¢(No) has a 1 at some position (k,/) such that k <1 and (&,]) # 
(m — 1, 2). 

(ii) ¢( No) has a 1 at some position (k, 7) such that k > 1 and (k,l) # (n, 
n—1). 
In (i) let D be a diagonal matrix in M,_2 with 1 at (k, k) and (m — 3) zero’s 
elsewhere on the diagonal. Then 


E;(D @ J2) = —1. 


where 


However o(D @ J:) has at most two non-zero rows and hence 


E;(e(D @ J2)) = 9. 





394 MARVIN MARCUS AND ROGER PURVES 


In a similar way we eliminate the alternative (ii). Hence o either interchanges 
or leaves fixed the entries at (m — 1,) and (m,m — 1). A similar argument 
for the other pairs of symmetrically located entries completes the proof. 


LemMA 4.3. If S € S,, r > 2 and 
S:A—- UAV 
or 
S:A—UA'V 
then U and V are permutation matrices. 
We omit the proof. 
THEOREM 4.4. Jf S € S; and n = p + 2, p > 1 then either 
S:A-—PAP’ forall A € M, 
or 
S:A—PA'P’ forall A € M, 
where P € M, is a permutation matrix. 

Proof. The proof is by induction on the integer p. For p = 1 the result in 
(1, Theorem 2) shows that S is a direct product (modulo taking the transpose), 
and Lemma 4.3 combined with argument used in the latter part of the proof 
of Theorem 3.1 establishes that S has the above form. Now we modify S as in 
Lemma 4.2 to obtain « € S; where o holds diagonal elements fixed. Assume 


the result for all integers up to p > 1. Then if C € My; = Mi—142 we have 
by Lemma 4.2 that 


7(0@ C) = 0 @a(C) 
and 


E;(¢(C)) 


E;(0 @ «(C)) = E;(e(0 @ C)) 
E;(0 @ C) = E;(C). 
By the induction hypothesis and the fact that o holds the diagonal elements 


fixed we see that if we consider ¢ as a mapping of M,_; — M,_, in the obvious 
way then 


o¢:C—C forall C € M,-1 
or 


o¢:C—C’' for all C € M,_1. 


Now it is clear that if A € M, = M,,2 and C,; € M,_, is the principal sub- 
matrix obtained by deleting row and column i of A then the above argument 
shows that 

a(C;) = C, 
or 


a(C,) = Cy’. 





-_ 











~~ 











LINEAR TRANSFORMATIONS 395 


Thus for each A € M, it follows that 

a@(A) =A 
or 

o(A) =A’, 
and the proof is complete. 


THEOREM 4.5. If C € H; then there exists D € M, such that 
C:A—-@DAD— forall A € M, 


where D is a diagonal matrix and @ = 1. 


Proof. \t suffices to show that there exist diagonal U and V in M, such that 
C(A) = UAV or C(A) = UA’V for then it is clear that Uy = 0-'V,— for 
i=1,...,# and & = 1. Now for each w € Q3, it is clear that we may con- 
sider C as a mapping of M; — M; by restricting C to the principal submatrix 
ofeach A € M, corresponding to the indices of w. Call the restricted mapping 


C. : M; — M3; and since C, preserves determinant it is a direct product: 
Cy: A—- UAV, for AE Mz. 


It is easy to check that U, and V, are diagonal by examining the images of 
Eu € M3, i = 1,2,3 and using the fact that C,(A) is a Hadamard product. 
Thus on each 3-square principal submatrix C has the desired form. It will 
clearly suffice to show that C: A + K oA has the property p(K) = 1. For 
then K has the form K,,; = a; 6, i,7 = 1,..., . We show that every 2-square 
submatrix of K is singular. Let (a; 8;) denote the submatrix of K involving 
rows a; a and columns §;, 82. Suppose {a, a2, 8:1, 82} involves fewer than 4 
distinct integers. Then it is clear that (a; 8,;) is a part of some principal 3- 
square submatrix whose row and column indices we will designate by 


6 = {y172753}. 

By the above argument Cy has the form 

Co :A- UsA Ve; Ae M; 
where Us and V, are diagonal with diagonal elements 1, 2, us and 2), v2, 03 
respectively. It follows that for some 4,, ie, j1, j2 that 

Kass: = U 4,04, S,t = 1,2 
and hence that (a; 8;) is singular. In case {a:, ae, 8:1, 82} consists of 4 distinct 
integers we consider the two 3-square principal submatrices corresponding to 
4 = fen, ae, Bi} and ¢= {a, ae, B2}. 

Again we see that 

C,:A—U,AV,,A € Ms; 

C,:A—-U,AV,,A € Ms; 











396 MARVIN MARCUS AND ROGER PURVES 


where U,, U., V, and V, are diagonal with main diagonals 
(41, U2, Hs), (t's, 42, U's), (01, V2, Vs), (01, D's, v's) 
respectively. We then obtain for some i), ji, is. 
Kas; = U 05, Kae: = UiViy 
Kas, = Ui s, Kase: = Uigdiy 
and for some 7, m2, mo, 
Kays; = S ad as Kos = nD ny 
Kass, = Clan } = oo 
From these equalities we see that 
Kays; /Kazs; _ Kais:/Kass, 


and again (a; 8,) is singular. 


REFERENCE 


1. M. Marcus and B. N. Moyls, Linear transformations on algebras of matrices, Can. J. Math., 
11 (1959), 61-6. 


University of British Columbia 





—~ & © @ 


ne 4 #7 fF © = | 














PRIME DUAL IDEALS IN BOOLEAN ALGEBRAS 


L. J. HEIDER 


1. Introduction. Let % denote an arbitrary Boolean algebra. Let Latin 
letters a, b,... denote general elements of 8 while the symbols 0, 1 denote 
the special smallest and largest elements. Let Greek letters a, 8,... denote 
various prime dual ideals of elements of 8. It is recalled that a prime dual 
ideal of B is a proper subset of B closed under finite intersections of its elements 
and maximal with respect to those properties. Every prime dual ideal includes 
the element 1 and for each element a of % includes either a or 4 (complement 
of a in 8) but not both. Occasional reference will be made to principal dual 
ideals of 8. These are subsets of 8 composed of all elements of 8 majorizing 
some fixed non-zero element of %. Finally, let X(%) denote the collection 
of all prime dual ideals of 8. Then, with the subsets X (a) = [a € X(B)\a € al, 
a € %, being used as a basis for open sets, the collection X(%) becomes 
(homeomorphic to) the Stone representation space for 8. 

The collection X(%), with its field of open-and-closed subsets, is primarily 
representative of the Boolean algebra 8. Special field-related properties of 
particular algebras % as, for example, the ability of 8 to be represented as a 
quotient-field of sets, appear as special properties of the field X (8). However, 
the same collection X(%), with its compact, zero-dimensional, Hausdorff 
topology, may, with equal ease, be regarded as the Stone-Cech compactifi- 
cation space 6Y of a completely regular topological space Y. In this case, 
the algebra % is provided by a basis of open-and-closed subsets of Y, and 
special properties of Y appear as special properties of X(B) and 9B. 

In either case, it is the points of X(%) that matter. These points are not 
undefined te-~ms, but complex structures, that is, prime dual ideals of a Boolean 
algebra %. Any prime dual ideal a of B has the property that if a finite union 
element V;.:" a; of 8 is in a, then some component element a, of this union 
is likewise in a. This universal property of prime dual ideals may obviously 
be generalized. Let 2 denote an infinite cardinal, and let J denote an index 
set of cardinality IJ. Assume that a union element ap = Vy; a; exists in B. 
In general, a prime dual ideal of 8 containing a) may or may not contain a 
component element of this union. 

This paper discusses the presence in X (%) of prime dual ideals that contain 
along with a union element a) = V;.; @; also a component element a, of that 
union. The first result of this discussion is a unified theory of the use of X(B) 
in the representation of Boolean algebras 8. Since the parts of this theory 


Received December 16, 1957; in revised form October 15, 1958. This paper was prepared 
with the support of the National Science Foundation under Research Grant NSF-G3965. 


397 





“se 








398 L. J. HEIDER 


have been developed by many authors, the present treatment is in outline 
form. The emphasis is on the unity of theory achieved by use of the above 
special property of prime dual ideals. The second result is a characterization 
of the Boolean algebras 8 for which the spaces X(%) may be regarded as 
the Stone-Cech compactification spaces 8 Y associated with three special types 
of completely regular spaces Y, namely, the P-, P’- and U-spaces of (3, 4). 
These special spaces were introduced because of the interest of the algebraic 
features of their associated rings of real-valued continuous functions. Our 
interest arose from the fact that for each space Y of any of these types the 
corresponding space 8Y is zero-dimensional and thus homeomorphic to the 
representation space X(%) of a Boolean algebra %. In the cases of the P- 
and P’-spaces, the points of 8Y = X(%) corresponding to points in Y involve 
intriguing properties of prime dual ideals. 


2. Boolean algebras and fields of sets. Let I? denote an arbitrary 
cardinal number. Let the concepts of a field of sets, an M-field of sets and 
an Q-complete Boolean algebra be understood in the usual sense. An Q- 
complete Boolean algebra is called Pt-representable if it is isomorphic to an 
M-field of sets modulo an Pt-complete ideal of that field. An Pt-complete 
Boolean algebra © is called P-distributive if 

VA @dy= vw, A Gino 

tel jet nes” ie] 
for each doubly-indexed family {a,,}, i © I, 7 € J, of elements of 8 for which 
the cardinalities I, J of the index sets do not exceed MP. Here J’ indicates 
the family of all maps 4 with domain J and range J. 

For any element a» of a given Boolean algebra % let ap = Vue a,, I < M, 
be called an Pt-representation of the element ao. Let 

a= V ay,i1€ LICM I, <M, 


jeJi 


be called an P-family of Pt-representations of ao. With this terminology and 
these concepts at hand, the principal parts of the theory may be presented 
in three statements. 

(A) The Boolean algebras that are isomorphic to P-fields of sets are the 
M-complete algebras that have for every non-zero element a prime dual ideal 
that contains a component of each Y-representation of that element. 

(B) The M-complete and P-distributive Boolean algebras are exactly those 
M-complete algebras that have for each non-zero element and for each M- 
family of Dt-representations of that element a principal dual ideal containing 
a component of each member of that family. 

(C) The M-complete and P-representable Boolean algebras are exactly 
those I-complete algebras that have for each non-zero element and for each 
M-family of M-representations of that element a prime dual ideal containing 
a component of each member of that family. 














eT eS 


o - 


bt 














IDEALS IN BOOLEAN ALGEBRAS 399 


These statements are made without proof. Their intended value lies in the 
unified treatment of diverse subjects that they provide. Statement (A) is an 
observation of Sikorski (10) in dual form. Enomoto’s theorems (2) regarding 
M-fields of sets in the wider sense involve but slight rephrasing of this state- 
ment. Statement (B) is well known (9, 11), but attention is here called to 
the position of P-distributive algebras midway between WP-fields of sets and 
quotients of such fields by Pt-complete ideals. Statement (C) was suggested 
by work of Chang (1), but is new at least in its simplicity. 

An apparent addition to the existing literature on the subject matter of 
statement (C) may well be made here. Let 8 be an Pt-complete Boolean 
algebra with representation space X(%). Let §(G) denote the P-field of sub- 
sets of X (B) generated by the subsets of X(B) of the type X (a) = la € X(B) 
ja € al,a € B. Let an element of §(%) of the form \,,X(a,) with J << M 
and Aja, = 0 in B be called an Pt-nowhere dense subset of X(B). Let ¥(B) 
denote the Yt-complete ideal in §(%) generated by these Pt-nowhere dense 
subsets. Attention is now called to the fact that, for each Pt-complete and 
M-representable Boolean algebra B, the quotient §(B)/3(B) is a specific 
example of an isomorphic representation of 8 as the quotient of a P-field 
of sets modulo an P-complete ideal. 


3. Fields of sets and topological spaces. The concept of a field of sets 
stands midway between that of a Boolean algebra and that of a topological 
space with a basis of open-and-closed subsets. Let 92 denote an arbitrary 
cardinal number. Let §(X) be an P-field of subsets of a set X. It will be 
assumed that §(X) is reduced, that is, for p # q in X there is an element 
O of §(X) with p € O and g¢O. Let (X, ET) denote the set X as under the 
topology ZT obtained by using the subsets of X in §{(X) as a basis for open 
sets. Any subset of X in (X) is open-and-closed in (X, T). However, there 
might be subsets of X not in §(X) that are open-and-closed in (X, T). Taney 
would be of the form 

A=U0,=N0; 
tel jet 
where the index sets J, J are arbitrary and each O, and O, is an element of 
§(X). This introduction of alien open-and-closed subsets will be undesirable 
for our purpose. Hence, a reduced Q-field of sets §(X) will be called union- 
intersection closed if every subset A of X as described above is an element 
of §(X). With each reduced, P-field there is associated a minimal, reduced, 
union-intersection closed, Pt-field including the given field. It consists of all 
subsets A as described above. 

We now turn to the very special topological spaces described in (3, 4, 8). 
As usual, for any topological space Y, C(Y) will denote the collection of all 
real-valued functions, defined and continuous on Y. For each element f of 
C(Y), let P(f) = [p € Y | f(p) > 0] and Z(f) = [p € Y|f(p) = O}. Let BY 


and vY denote, respectively, the Stone-Cech compactification space and the 








400 L. J. HEIDER 


Hewitt Q-space associated with a completely regular space Y. The first 
special completely regular spaces to be considered are the P-spaces. 

The P-spaces may be characterized in a number of different ways (3, 
Theorem 5.3). For one thing, a completely regular space Y is a P-space if, 
and only if, every countable intersection of open sets of Y is itself open in 
Y. From this it follows that each P-space Y is a zero-dimensional Hausdorff 
space in which each countable intersection of open-and-closed subsets is 
open-and-closed. Hence, dually, in a P-space any countable union of open- 
and-closed subsets is likewise open-and-closed. Thus, if Y is a P-space and 
#(Y) is the field of open-and-closed subsets of Y, then §(Y) is a reduced, 
union-intersection closed, o-field of sets in the sense explained above. 

Conversely, let §(Y) be a reduced, union-intersection closed, o-field of 
sets. Use the subsets of Y in §(Y) as the basis of a topology T on Y, and let 
(Y, X) denote Y with this topology. 


THEOREM 3.1. If §(Y) is a reduced, union-intersection closed, o-field of sets, 
then (Y,Z) is a P-space and every P-space may be thus described. 


Proof. With §(Y) and (Y,&) as described, it is obvious that (Y, TZ) is a 
zero-dimensional Hausdorff space and thus completely regular. Consider, 
moreover, the intersection (\U, of a countable family {U,} of sets open in 
(Y, ZX). If po is a point of Y in this intersection, then there exists a family 
{O,} of sets in §(Y) with po € O, < U, for each n. Hence, with §( Y) a o-field, 
there exists an element O» of §(Y) with po € Op C U, for each n. Thus any 
countable intersection of open subsets of (Y, TZ) is open, so that (Y,T) is 
a P-space. 

If, conversely, one begins with a P-space Y and then forms §(Y) and 
(Y, E) as described, clearly (Y, XT) is homeomorphic to Y. 

With the P-spaces thus firmly linked to reduced, union-intersection closed, 
a-fields of sets, attention is turned elsewhere for the moment. First, two 
additional facts (3, Theorem 5.3, (2) and (3)) concerning P-spaces are needed: 
if Y is a P-space, so likewise is vY; if Y is a P-space, then the zero-set Z(f) is 
open-and-closed in Y for each element f of C(Y). 

Now, for any completely regular space Y and for any point po in Y, let po 
be called a P-point of Y if for each element f of C(Y) there exists a neigh- 
bourhood U of po in Y such that f(p) = f(p0) for each point p in U. Then, 
from the facts cited just above, it follows that for any P-space Y each point 
of vY is a P-point of v¥. Next consider BY = B(vY). It is rather obvious 
that each P-point of vY as imbedded in 8Y becomes a P-point of BY. On 
the other hand, no point p of 8Y — vY as in BY is a P-point of BY. Thus, 
for each point j of this type, there is an element f of C(@Y) with f(p) = 0 
while f(p) > 0 for all points p of vY (5, Example 2.3). This, of course, excludes 
the local constancy of f at j since the points of vy Y are dense in BY. Thus, for 
any P-space Y, the points of vY as imbedded in @Y are identified with the 
P-points of BY. 





~~ ore ee” 





—=—_ee 

















IDEALS IN BOOLEAN ALGEBRAS 401 


The fact that each zero-set Z(f) associated with a P-space Y is open-and- 
closed in Y indicates that for such spaces the sets P(f) are likewise open- 
and-closed in Y. Thence it follows (4, Theorem 8.3) that for any P-space Y 
the lattice C(Y) is conditionally countably complete, so that BY = X(%) 
where % is a o-complete Boolean algebra (12). This algebra may, of course, 
be identified with the Boolean algebra of all open-and-closed subsets of BY 
or, equivalently, of vY¥ or even of Y itself. 


4. P-spaces and Boolean algebras. Interest now turns to the P-points 
of a space X(%) where % is a o-complete Boolean algebra. Each point of 
X(%) is a prime dual ideal of 8. Let a be such an ideal while M is a cardinal 
number and J is an index set with J < QM. We introduce two conditions: 


(I —M) If Viera, exists and is in a, J < M, then some a, is in a. 
(II —M) If {a,,i € I} Ca, I < M, then A,,; a, exists and is non-zero. 


For Jt-complete Boolean algebras the two conditions are equivalent. For 
any Boolean algebra, if condition JJ — M is satisfied with respect to a parti- 
cular prime dual ideal, then condition J — M is satisfied also. 

A Boolean algebra $ will be called a @(JJ — M, D) algebra if the prime 
dual ideals of B satisfying condition JJ — Mt are dense in X(B) or, equiva- 
lently, each eiement of % is contained in a prime dual ideal of 8 satisfying 
this condition. For each S(JJ — M, D) algebra B, let D also denote the 
subspace of X(%) consisting of all points (prime dual ideals) satisfying con- 
dition JJ — M. A similar definition and notation can be used for B(J — M, D) 
algebras. Although reference is made to an arbitrary cardinal number WM, 
interest centers on the first infinite cardinal number XN) = ¢. Two lemmas 
are now in order. 


LemMA 4.2. Every (JJ — M, D) Boolean algebra is M-complete. 


LemMMA 4.3. For any @(II —o,D) Boolean algebra %, the P-points of the 
space X(B) are the prime dual ideals satisfying condition I — ¢ = II — ac. 


The proof of Lemma 4.2 is brief. Let {a,,i € I}, I < M, be a subset of 
elements of a B(JJ — M, D) algebra B. If Vara, ¥ 1, there exists element 
ay of B, ao ¥ 0, with ap < 4, for all i in J. However, for each non-zero element 
a) of a BUI — M, D) algebra, there exists a prime dual ideal ap of that 
algebra containing a» and in which condition JJ — MM is verified. Then {4,, 
t € I} Cap, so that Ay, & and thus V;,; a; exists and the lemma is proved. 
Referring to statement (A) of the second section, it is now clear that the 
Boolean algebras isomorphic to J-fields of sets are exactly the B(JJ — M, D) 
algebras and that, for each such algebra, the associated Yt-field of sets may 
be taken as the field of open-and-closed subsets of the subspace D of X(B). 

Lemma 4.3 is a particular instance of a more general statement (3, Theorem 
4.2 (3)) and returns us to the subject of P-spaces. From it one sees that for 
each P-space Y the Boolean algebra % of all open-and-closed subsets of Y 











402 L. J. HEIDER 


is a @(JI — a, D) algebra with BY = X(%) and that the space vY may be 
identified with the subspace D of the representation space of this algebra. 
However, such @(JI — ¢, D) algebras % are still of a special character in 
that 8D = X(%). This may be cared for in the following way. 

Henceforth, a P-Boolean algebra will be understood as any O(II — a, D) 
algebra 8 in which the following completeness condition obtains: every 
collection {a,;,7 € I} of elements of 8 such that each prime dual ideal in D 
either contains an element of that collection or contains an element of 8 
disjoint from every element of the collection has a least upper bound V;.; a; 
in B. 

The significance of this completeness condition is explained in two steps. 
Let §(D) denote the field of open-and-closed subsets of the subspace D of 
the representation space X(%) of a B(JJ — oc, D) algebra B. As noted in 
reference to Lemma 4.3, §(D) is a reduced, c-complete field of sets isomorphic 
to the algebra &. As the first step, it is shown that §(D) is union-intersection 
closed exactly when the given G(JJ — oc, D) algebra satisfies the stated 
completeness condition. Recall that elements a of $ are in 1 — 1 order 
preserving correspondence with elements O of §(D) through the relationship 
X(a)(\D =O. Then, for any subset A = U,x;O; = (\y,O; of D, the 
elements a, of 8 corresponding to the elements O, in ;.; O; are such that 
each prime dual ideal of D in A contains one of the a;, while each prime dual 
ideal of D in D — A contains an element 5, of 8 disjoint from each of the a,, 
namely, an element 5, of 8 corresponding to the complement in D of some O, 
in (\y.7O;. Then, with a = V;.; a; existing in 8, it is clear that X(a) (\ D = A, 
so that §(D) is union-intersection closed. Conversely, if the set §(D) is 
union-intersection closed and {a,,7 € I} is a family of elements of 8 such 
that each prime dual ideal in D either contains an element a, of this family 
or an element 6, of % disjoint from every member of the family, then, 
with A = U,.;[X(a,) CD], one has D—A = U;,;[X(b;)) C\ D). Then 
A = Ueer[X (as) OD] = Cer [X (63) CO DJ. Finally, with ap in 8 such that 
X(ao) (\D =A, it easily follows that a) = Vu;a; in %, so that the 
completeness condition follows. 

As the second step, it is now shown that the demand that §(D) be union- 
intersection closed is equivalent to the demand that 8D = X(%). First assume 
that §(D) is union-intersection closed. The space (D, TZ) consisting of the 
set D and the topology T derived from the field §(D) is homeomorphic to 
the space D as a subspace of X(B). Hence 8(D, XT) = 8D. However, (D, T) 
is a P-space so that 8(D, TZ) is the representation space of the algebra of all 
open-and-closed subsets of (D, T). With §(D) union-intersection closed, this 
latter algebra is isomorphic to the algebra §(D) and thus to the given 
B(II — «, D) algebra B. Hence 8(D, T) = X(B). Thus, if F(D) is union- 
intersection closed, then BD = X(%). Conversely, if BD = X(%B) so that 
each open-and-closed subset of D in its relative topology is of the form 
X(a) (\ D, then §(D) is obviously union-intersection closed. 




















IDEALS IN BOOLEAN ALGEBRAS 403 


The preceding observations are now summarized. 


THEOREM 4.4. The class of all P-Boolean algebras is identical with the class 
of all algebras of the open-and-closed subsets of the P-spaces. For any P-space Y, 
the spaces BY and vY are homeomorphic to the spaces X(B) and D associated 
with the P-Boolean algebra of all open-and-closed subsets of Y. Two P-spaces 
Y and Z correspond to the same P-Boolean algebra if, and only if, BY = BZ. 


With P-spaces characterized as completely regular spaces in which countable 
intersections of open sets are open, it seems proper to ask concerning completely 
regular spaces in which any Q-intersection of open sets is open, It being a 
cardinal number presumably larger than X% = ¢. Such spaces may be referred 
to as P-I-spaces. Let a B(JJ — M, D) algebra satisfying the additional 
completeness condition cited above for P-Boolean algebras be called a 
P-IM-Boolean algebra. An exact analogue of Theorem 4.4. may then be stated 
concerning the relationship of P-Pt-spaces and P-J-Boolean algebras. 


5. The P’-spaces. The P’-spaces form the second class of completely 
regular spaces to be discussed here. Their characterization embodied a slight 
weakening of that of the P-spaces. However, the most enlightening charac- 
teristic of the P’-spaces is the following: for each element f of C(Y) and for 
each point po of Z(f), if there is no neighbourhood U of pp» in Y such that 
f(p) = 0 throughout U, then there is a deleted neighbourhood U’ of p» such 
that f(p) > 0 throughout U’ or f(p) < 0 throughout U’. It is this feature 
of P’-spaces that guides the next procedures. Use is also made of the fact 
(4, Theorem 8.4) that, for each P’-space Y, 8Y = X(B) where B is a o-com- 
plete Boolean algebra. 

Let a point fo of an arbitrary completely regular space Y be termed a 
P’-point of Y if it has the property cited just above. 


Lemma 5.1. Let Y be a completely regular space such that BY = X(B) where 
B is a o-complete Boolean algebra. Let each point p of Y as in X(B) be con- 
sidered as a prime dual ideal a, of B. Then a point p of Y is a P’-point of Y 
if, and only if, the corresponding prime dual ideal ap satisfies the following con- 
dition: for each countable union 1 = Va, in 8 of which no component element 
a, is in ap, there exists a non-zero element ao of B with ay in ag and such that all 
other a, containing ao contain likewise some component of the given union. 


Proof. Assume first that p is a P’-point of Y. Let 1 = Va, be a disjoint 
countable union of elements of 8 of which no component a, is in ay. Then, 
because of the o-completeness of %, there exists an element f of C(X[%})) 
with f(a) = 1/n for each prime dual ideal (point) a containing a,. Now let ao 
be any element of 8 in as. Then aoa a, # 0 for at least one element a, of 
the union 1 = Va, and, since a, contains no element of this union, actually 
ao A a, * 0 for infinitely many subscripts . From this it follows that f(a,) = 0. 
However, with a P’-point of Y, there exists a deleted neighbourhood U’ 











404 L. J. HEIDER 


of # in Y and thus a particular element a» of B in as such that f(a,) > 0 for 
all a, containing ao, a, # ay. However, f(a,) > 0 means f(a,) = 1/n for some n. 
This, in turn, is easily seen to mean that a, € a,. Thus there exists an element 
a of B in ag such that every a, containing do, a, # az, contains likewise some 
element a, of the given countable union. 

Conversely, assume that prime dual ideal as of o-complete algebra 8 
corresponding to point # of Y has the property with respect to countable 
unions stated in the theorem. Let element f of C(Y) be such that f(A) = 0. 
Assume, for the moment, that f is non-negative throughout Y. Let fo = fa 1 
in the usual sense of function lattices. Let fy or, for notational simplicity, 
simply f denote the extension of fy over BY = X(B). Let O, = [a € X(B) 
f(a) < 1/n]. Then, by reason of the c-completeness of 8, there exists element 
a, of ® such that X(a,) = O,. The sequence {a,} is obviously such that 
Qn+1 < dy. Form the element ao = Aa, in &%. Finally, construct a new sequence 
{b,} in B with: bp = d, b; = 1A Gi, b: = ai A do,.... 

Now V,-{0 5, = 1 and is a countable disjoint union. If some (non-zero) 
b, is in ag, clearly this 5, is b) = a) and one concludes that f(a,) = 0 for all 
a, with ap € ay. Then U = [p € Y| ao € a] is a neighbourhood of f in Y 
such that f(p) = 0 throughout U. If no 3, is in ag, then, by hypothesis, there 
is an element co of B in ay such that every a, containing Co, ay ¥ ay, contains 
some (non-zero) 5,. Since bp is here assumed as not contained in ag, this first 
co may be replaced by bya co. Denote this element also by the symbol c@. Then 
each a, containing Co, ap ¥ az, contains also an element b, of the countable 
disjoint union and this 6, is not the element bo. However, with b, = a,_1A a, 
in a, m > 1, then 1/n < f(a,) < 1/(m — 1) so that f(a,) is non-zero. One 
concludes from this that U’ = [p€ Y|\p# and co € a] is a deleted 
neighbourhood of # in Y such that f(p) > 0 throughout U’. 

Finally, for an arbitrary element f of C(Y) with f(f) = 0, first apply the 
above analysis to the elements f+, f~ formed in the usual function-lattice 
sense. Note that if f*+(p) > 0 throughout a deleted neighbourhood, then 
f-(~) = 0 throughout the same neighbourhood. With this in mind, this 
converse part of the theorem is easily seen to hold for all elements f of C(Y) 


with f(p) = 0. 


THEOREM 5.2. Let X(B) be the Stone representation space of a o-complete 
Boolean algebra B. Let Y be a subspace of X(B) such that BY = X(B) and 
also such that for every countable union \/a, = 1 in B each point (prime dual 
ideal) ao of Y either contains a component of this union or contains an element 
ao of B such that every other point a of Y which contains ay contains an element 
of this union. Then Y is a P’-space and every P'-space may be thus described. 


For the sake of brevity, a Boolean algebra of the type described in Theorem 
5.2 will be called a P’-Boolean algebra. The description of such algebras is 
very awkward. However, with 8, X(%) and Y as described in that theorem, 
consider the field §(Y) of open-and-closed subsets of Y. Obviously §(Y) is 




















IDEALS IN BOOLEAN ALGRBEAS 405 


reduced and union-intersection closed. In view of the o-completeness of 8, 
also §(Y) is e-complete in the sense that every countable set of elements of 
#(Y) is contained in a smallest element of §(Y). Finally, from Theorem 5.2, 
§(Y) is seen to have an additional property that may be called the near-o- 
field property; if O is the smallest element of §(Y) including each of the 
elements {O,} and if point # of Y is in O but in no O,, then there exists element 
Op of ¥(¥) with p € Op while p € Oo, p ¥ H, implies p € O, for some n. Thus 
for any P’-Boolean algebra % as described in Theorem 5.2 the associated 
field §(Y) is a reduced, union-intersection closed, e-complete, near-c-field 
of sets which, as a Boolean algebra, is isomorphic to 8 while the space (Y, T) 
derived from §(Y) is homeomorphic to the P’-space Y. Note that 6(Y, T), as 
homeomorphic to X(%), is of dimension zero. 


Conversely, let §(Y) be a reduced, union-intersection closed, s-complete, 
near-o-field of sets and let (Y,T) be formed as usual. Then, by methods 
similar to those used in Theorem 3.1, it may be proved that (Y,T) is a 
P’-space, provided one has assurance that 8( Y,Z) is of dimension zero. Whether 
or not such assurance is contained in the stated assumptions regarding §(Y), 
the present writer does not know. However, he has indicated elsewhere (6) 
how to state such assurance regarding 8( Y, E) in purely set-theoretic language. 

These observations are now summarized. 


THEOREM 5.3. The P’-Boolean algebras are identical with the algebras formed 
under the inclusion relation by elements of reduced, union-intersection closed, 
o-complete, near-c-fields of sets §(Y) with B(Y,Z) of dimension zero. Such 
fields, in turn, may be identified with the fields of open-and-closed subsets of the 
P’-spaces. 


6. The UF-Boolean algebras. We turn now to the U-spaces described 
in (4). A completely regular space X is a U-space if, and only if, to each 
element f of C(X) there is associated a unit element uw in C(X) such that 
f =u.\f|. For any completely regular space X, X is a U-space if, and 
only if, 8X is a U-space (4, Theorem 5.2). Finally, 8X is a U-space if, and 
only if, it is zero-dimensional and for each element f of C(8X) the sets P(/) 
and N(f) are completely separated in 8X. The zero-dimensionality of such 
BX links the U-spaces to Boolean algebras. 

Let % again denote an arbitrary Boolean algebra. Let p = {a,} denote a 
monotone, non-decreasing sequence of elements of 8. For the sake of brevity, 
refer to a sequence like p as a tower in 8. Two towers p = {a,} and r = {5,} 
will be called disjoint if a, a 5, = 0 for each positive integer m. Finally, an 
element a> of 8 will be called a cap of a tower p if a, < a» for each element 
a, of p = {a,}. 

Now define a Boolean algebra 8 to be a UF-Boolean algebra if, and only 
if, disjoint towers in 8 have disjoint caps in 6. The UF-Boolean algebras 
have a close relationship to the U-spaces (and F-spaces) of (3; 4). 








406 L. J. HEIDER 


THEOREM 6.1. The UF-Boolean algebras are exactly those Boolean algebras 
B for which the sets P(f{) and N(f) are completely separated in X(®) for each 
element f of C|[X(®)]. 


Proof. Assume that % is a UF-Boolean algebra and let f be an element of 
C[X(B)]. Let F, = [a € X(B) | f(a) > 1/m] while O, = fa € X() | f(a) 
> 1/(m + 4)]. Then, using the compactness of F, and the openness of O,, one 
can conclude to the existence in 8 of an element a, such that F, C [a € X(B) 
| a, € a] © O,. Moreover, since F, C O, © Fasi1 © Onyi, one has a, < dna 
and the sequence p = {a,} is a tower in %. Similarly, with F,* = [a € X(%) 
| f(a) < — 1/n] and O,* = [a € X(B) | f(a) < — 1/(m + }4)], let a second 
tower r = {b,} be constructed with F,* C fa € X(B) |b, € a] © O,*. The 
two towers thus formed are clearly disjoint and thus, by assumption, have 
disjoint caps a and bo. It is now but a small matter to verify that P(f) C 
[a € X(B) | ao € a] and N(f) C [a € X(B) | bo € a] so that the sets P(f) and 
N(f) are completely separated in X(%). 

Conversely, assume that for each element f of C[X(%)] the sets P(f) and 
N(f) are completely separated in X(B). Let p = {a,} and r = {6,} be a pair 


of disjoint towers in %. Let f, be the unique element of C[X(%)] with f,(a) = 1 
for all a with a, € a, with f,(a) = — 1 for all a with 5, € a and with f,(a) = 0 
for all a containing 4, A 6,. Finally, form fo = 2,.:°f,/2". Then fo is an element 


of C[X(%)] and, by assumption, the sets P(fo) and N(fo) are completely 
separated in X(%). In virtue of the zero-dimensionality of X(%), this implies 
that there exists elements a) of B such that P(fo) C [a € X(B) | ao € al, 
while N(fo) © [a € X(B) | & € a]. The element ap is now seen to cap the 
tower p = {a,} while its complement da caps the tower r = {b,}. Thus the 
theorem is proved. 


The observations of this section may now be summarized. 


THEOREM 6.2. Any UF-Boolean algebra is the algebra of all open-and-closed 
subsets of some U-space and any such algebra is a UF-Boolean algebra. Two 
U-spaces Y and Z correspond to the same UF-Boolean algebra if, and only 
if, BY = BZ. 


7. Comments. This section begins with an observation concerning F- 
spaces (4). A completely regular space Y is an F-space if, and only if, for 
each element f of C(Y) the sets P(f) and N(f) are completely separated. Every 
F-space Y has the following property (4, Theorem 2.6) pertinent to our 
purpose: for each zero set Z of Y each element f of C*(Y — Z) has a continuous 
extension f in C*(Y). Here C*(Y) indicates the collection of bounded elements 
of C(Y). 


LemMaA 7.1. Let Y be a completely regular F-space. Then BY is without G;- 
points other than isolated points. Moreover, a point p of Y is a non-isolated 





-_— 





—_j FA = &- © fF =| & 











IDEALS IN BOOLEAN ALGEBRAS 407 


G;-point in Y if, and only if, every element f of C*(Y — {p}) has a continuous 
extension at p while some element of C(Y — {p}) lacks such an extension. 


Proof. As regards the first assertion, assume that p is a G;-point of BY. If p 
is not an imbedded point of Y in @Y, then every element of C*(8Y — {}) 
has a continuous extension at p by definition of BY. If p is an imbedded point 
of Y in BY, then {)} is a zero set in Y and, by the property of F-spaces cited 
above, one again concludes that every element of C*(@Y — {p}) has a con- 
tinuous extension at ». Hence 6(8Y — {p}) = BY unless p is an isolated 
point of 8Y. However, for any completely regular space X the cardinality 
of a zero set contained in 8X — X is at least exp (exp No) (7, Theorem 49). 
Thus the point » must be an isolated point in @Y. 

As to the second assertion, it is merely to be noted that if a point p of Y 
has the extension properties listed in the theorem, then 8(Y — {p}) = BY 
while p¢u(Y — {p}). From this it follows easily that such a point is a 
G,-point (5, Example 2.3). 

Now the F-spaces X such that 8X is zero-dimensional and thus of present 
interest are identical with the U-spaces (4, Theorem 5.5). With the U-spaces 
described in terms of Boolean algebras, attention may now be called to the 
following conclusion. 


THEOREM 7.2. The Stone representation spaces of Boolean c-algebras and, 
more generally, of UF-Boolean algebras are without G;-points other than isolated 
points. 


This theorem cannot be extended to include all Boolean algebras. In a 
written communication, C. W. Kohls called the attention of the writer to 
the following example. 


Example. Let N denote the set of all positive integers. Let 6(N) denote 
the class of all finite subsets of N along with their complements in N together 
with the empty set and the set N itself. As partially ordered by the inclusion 
relation, 8(N) is a Boolean algebra. In X(SB[N]) there is only one prime dual 
ideal other than the point-principal dual ideals. That ideal consists of all the 
infinite subsets of N in 8(N). As a point of X(B[N]) this ideal is obviously 
a non-isolated G;-point. It is also easily seen that 8(V) is not a UF-Boolean 
algebra. Thus let a, = {1,3,..., 2” — 1} and b, = {2,4,..., 2m}. Then, as 
elements of B(N), a, < Gnit, bn < dng: and a, A 5, = 0. However, it is im- 
possible to find in @(N) elements ao, bo with ao A bo = 0 and such that 
@, < a and b, < bo for all positive integers n. 











408 L. J. HEIDER 


REFERENCES 


1. C. C. Chang, On the representation of a-complete Boolean algebras, Trans. Amer. Math. 
Soc., 85 (1957), 208-18. 
2. S. Enomoto, Boolean algebras and fields of sets, Osaka Math. J., & (1953), 99-115. 
. L. Gillman and M. Henriksen, Concerning rings of continuous functions, Trans. Amer. 
Math. Soc., 77 (1954), 340-62. 
Rings of continuous functions in which every finitely generated ideal is principal, 
Trans. Amer. Math. Soc., 82 (1956), 362-91. 
5. L. J. Heider, A note concerning completely regular G; spaces, Proc. Amer. Math. Soc., 8 
(1957), 1060-6. 
Compactifications of dimension zero, Notices Amer. Math. Soc., 552-9. 
. E. Hewitt, Rings of real-valued continuous functions, I, Trans. Amer. Math. Soc., 64 (1948), 
45-99. 
8. J. R. Isbell, Zero-dimensional spaces, Tohoku Math. J., 7 (1955), 1-8. 
9. R.S. Pierce, Distributivity in Boolean algebras, Pacific J. Math., 7 (1957), 983-93. 
@. R. Sikorski, On the representation of Boolean algebras as fields of sets, Fund. Math., 36 
(1948), 247-58. 
11. E. C. Smith, Jr., A distributivity condition for Boolean algebras, Ann. Math., 64 (1956), 
551-61. 
12. M. H. Stone, Boundedness properties in function lattices, Can. J. Math., 1 (1949), 176-86. 


n~ 








Institute for Advanced Study 
Marquette University 


























ON A PAPER OF MAURICE SION 
MARK MAHOWALD 


1. Let M, be the set of measures yu on the real line such that open sets are 
u*-measurable. While attempting to find out whether a set u*-measurable for 
all « in M, is mapped into a similar set by a continuous function of bounded 
variation, Maurice Sion develops a theory for what he calls variational 
measure (4). As an application of the theory, he gets conditions on a function 
f and a set of measures M in order that f map a set, which is u*-measurable for 
all » € M, into a set of the same kind. In particular he proves for his class 
M; (def. 2.5), the following theorem (4, § 8.11). 


THEOREM. If A is measurable for all measures in M, and if f is continuous 
from the irrationals to [0,1], then f(A) is measurable for all measures in M3. 


Since all projective sets are continuous images of the irrationals (2, p. 39) and 
since the existence of a non-measurable projective set is consistent with the 
axioms of set theory if they are consistent, (1), Sion concludes that Lebesgue 
measure is not in M». 

We prove Sion’s result in another way and more importantly, we characterize 
M; completely with respect to open regular measures. As an application, we 
prove, without the continuum hypothesis, the existence of a function dis- 
continuous on every set of positive outer measure (Lebesgue). 

The author is indebted to William Larkin for many helpful critical com- 
ments. 


2. Notation and definitions. 


2.1. A partition, P(S), of a set S is a collection of sets, E C S, finite in 
number, pairwise disjoint and whose union is S. 

2.2. A refinement of a partition, P,, is a second partition, P2, such that each 
set in P, is a subset of some set in P. 

2.3. An open regular measure is a measure such that each u*-measurable set 
has a measurable cover which is a G; set. 

2.4. Mo = |u: uw is a measure on [0, 1] and open sets are u*-measurable}. 


2.5. A sequence with property A is a sequence of partitions P,(S) such 
that 


(a) SC [0, l]JandO < yp*(S) < @; 
(b) P41 is a refinement of P,; 
(c) if BC S and yu*(B) > 0, then 


Received October 17, 1958. 








410 MARK MAHOWALD 


lim >> u*(BN) E) = @. 
Raw EtPn 
2.6. Mz = \u:u € Mo and there does not exist a set S with a partition se- 
quence having property 4A]. 
2.7. Ms = \usu € Mo, uw is open regular, there exists a partition sequence 
P,,({0, 1]) with property A}. 
2.8. A measure will be called non atomic if no single point has positive outer 
measure. 


3. Conditions implying a measure is not in M, or is in M;. 


3.1. THEOREM. If there exists a set S C [0,1] of positive outer measure and a 
bounded function f defined on S discontinuous on every set E C S for which. 
u*(E) > 0 and if u is open regular, then up ¢ M2. 


The proof is long and will be given in § 5. 
3.2. CoROLLARY. Jf S = [0, 1] in the theorem, then p € M3. 


This corollary is an immediate consequence of the proof of the theorem 
(see 5). 

3.3. The following lemma is obtained by a minor modification of the proof 
of the similar theorem (without the word ‘“bounded’’) due to Sierpinski and 
Zygmund (3). 


LemMA. There exists a bounded function from the reals to the reals which is 
discontinuous on every set having the power of the continuum. 


3.4. Then we can prove this 


THEOREM. If yu is such that every set of positive outer measure has the power 
of the continuum and yu is open regular, then u is in M3. 


Proof. The theorem follows immediately from 3.2 and 3.3. 


3.5. COROLLARY. If there exists one set of positive outer measure such that all 
subsets of positive outer measure have the power of the continuum and if yp is 
open regular for all subsets of this set, then up ¢ M2. 


3.6. COROLLARY. Under the continuum hypothesis: If u is non-atomic and 
open regular, then u is in M3. If there exists a subset of positive outer measure 
such that every single point subset has measure zero, then up ¢ M2. 


Proof. If a measure is non-atomic then every countable set has measure 
zero. The continuum hypothesis then implies every set having positive outer 
measure has the power of the continuum and 3.4 and 3.5 prove the theorem. 

3.7. Every measure on the subsets of the unit interval is either in M; or it 
is not. The definition of M;, which enables one to decide whether or not a 
measure is in M;, does not depend on the continuum hypothesis, that is, the 


























ON A PAPER OF MAURICE SION 411 


definition makes sense if the hypothesis is true or false. Now, if there exists a 
non-atomic, open regular measure not in M;, then this can be shown by a set 
theoretic argument. Such an argument with corollary 3.6 would be a proof 
from set theory of the proposition: the continuum hypothesis is false. Gédel 
(1) has shown that this cannot be proven with such an argument. Therefore, 
all open regular, non-atomic measures are in M;, that is, we can improve 3.6 
to the following 


THEOREM. If wu is open regular and non-atomic, then wp € M3. 
3.8. We can restate this by this 


THEOREM. There are no o-finite open regular measures in My. Lebesgue 
measure 1s not in M, but it is in M3. 


4. A converse to Theorem 3.1. 


4.1. THEOREM. Jf u is an open regular measure not in M, and S is a set with 
a partition sequence having property A, then there is a function defined on S 
which is discontinuous on every subset E of S such that u*(E) > 0. 


Proof. For uw, there exists a sequence P,,(S) of partitions with property A. 
Let Fi:,..., Fe: be a numbering of the sets of P;. Let m, be the smallest in- 
teger larger than loge k. Define 

fi(x) = (§ — 1)/2" 
for x € Fy, fori =1,...,2. 

Suppose for m — 1 we have defined n,,_,, a numbering, F;,,-1, for the par- 
tition P,,1, and f»1. The induction step will be defined as follows: 

Let Pn = max,j,, where j, is the number of sets in P,, which are subsets of 
F, m1. Let hy be the smallest integer greater than logs(p,, + 2). Let n,, = 
lim + M%m—1. Let Fim, for 


i= (¢g—1)2""+1,...,(¢—1)2" +h, 
and 
ae 
be a numbering of the sets of P,, which are subsets of F, 1. If Fim does not 
appear in this numbering, then F;,, = ¢. Then define 
Sm(x) = (i — 1)/2"" for x € Fup. 


The sequence f,, is monotonically non-decreasing and is uniformly bounded 
by one. Therefore there exists a limit function fo. 
For m fixed, our choice of h,, assures us that 


lx: f(x) = (q2’" — 1)/2""| = @ 


since it would equal 


F ahem 








412 MARK MAHOWALD 


which is empty. As a consequence, we have: if 
folx) < (q2'")/2" 


for any m and 


then 
fo(x) < (q2’" — 1)/2"*. 
Therefore 
Fin = |x: [¢ — 1 — (1/2""*")]/2™ < fo(x) < 4/2""|. 


Now suppose there exists a set B C S such that u*(B) > 0 and such that 
fo is continuous on B. Since F;,, is the inverse image of an open set, F;,, (\ B is 
open in B, that is, there exists an open set U;,, such that Fyn (\ B = Usm C\ B. 
Let Uigm = Uim (\ Um for t # j and Usjm = @ and let Vim = Uim — UsU sym. 
Clearly V is pairwise disjoint for m fixed. Also, since F is pairwise disjoint 
for m fixed, no point of Fy, (\B can be in Uy, for i # 7. Therefore, we 
have Vin (\B = Fi, (\ B. Hence we can choose a measurable cover of 
Fin (\ B, Cim, which is a subset of V;,,. Therefore, C is pairwise disjoint and 


> u*(Fin OB) = YS w(Cim) = w(Usr Com) = u*(B). 
Since m is arbitrary, we have 
lim >) u*(Fin (1) B) = u*(B) 
but this is just 
lim >) u*(B NO E) = u*(B) < u*(S) < @. 


msn EtPm 


This contradiction proves the theorem: 


4.2. COROLLARY. For every u ©€ Ms, there exists a function discontinuous on 
every set having positive outer measure. In particular there exists such a function 
for Lebesgue measure. 


5. Proof of Theorem 3.1. Let/ be the function described in the theorem. 
Since f is bounded we can suppose that 0 < f(x) < 1. Let Ey = |x: 4/2" < 
f(x) < (+ 1)/2"|, 1 = 0,...,2"%. Set P,(S) = {Eu}; we shall prove that 
this sequence has the property A. The facts that P is a partition and that 
P41 is a refinement of P, are clear. We need only show that, for any B C S 
for which »*(B) > 0, we have 


Qn 
lim >) u*(Ent E) = @. 
na i=0 
Assume that there exists a set E C S such that 
(1) 0 <lim >> u*(Ey,.N E) =a < @. 
Nap i 








_—_ 




















ON A PAPER OF MAURICE SION 413 


We then shall prove that there exists a subset of E having positive outer 
measure and on which f is continuous and this contradiction will prove the 
theorem. 


Subadditivity of u* implies that the limit in (1) approaches a from below. 
Therefore, there exists an N such that » > N implies, for 2/10 > « > 0, 


a-e< > w(En NE) <a 


and in particular 


a-—e< 2 u*(Ev; ME) <a. 
i 


Let By, be a measurable cover of E(\ Ey,. If Ey is a subset of Ey,, we 
shall write E,,, and we shall designate a measurable cover of E(\ Ey, by 
B,x;. It is easily shown that the sets B,,,;, » > N, can be so determined that 
if J is a set of integers such that Uy.; Ems = Ene, then User Bary = Bary. 

We next derive measurable sets H,,; contained in B,,,, disjoint for each 
fixed pair m, 7 and with 


a—2%< >> Ay <a. 


j.t 
Let 
(2) Baixs = Busy C\ Bars ixk 
= @ t=k 
and let Aysy = Buss = UO, Brix y- Since, for n> N, 
a—e< D> u(By)) = DY ulUBris)) <a 
i i 
and 


a—e< > ul(Ba;) <a, 


3.t 


we have 


p> b u( Bais) = aU Bau) | = > u(UeBa as) <q «. 
j t j.4 
From this and the definition of H, we have 
(3) a-—-2< z. # (Anis) <a 
j.f 


for all nm > N. By the choice of B, ;H,4; is monotonically decreasing as a 
function of m for each j. Letting H; = (,\U:Hu;, we have from (3) 


(4) »» u(H,) > a — 2. 


We next obtain formulas analagous to (3) and (4) with the sets H,,, re- 
placed by open sets. Let V,,; be an open cover of H,,; such that 


u( Vass) C (Aas) + €/2*(2" + 1) 











414 MARK MAHOWALD 


and if J is a set of integers such that U,.7 Em = Ene, then User Viney C Vary. 
Such a cover exists because of the open regularity of u. Then for every n > N 
we have 


G= 2e < py (Anis) = 7 u(U ais) < 7. #(U Vass) 
t, j r 
< p> w( Vas) < 7. u(y i) +ecqate 
tJ ij 


Therefore 
, (Vass) ‘atl Z #(U Vanes) < 3e. 
i,j 5 | 
Using notation analogous to that of (2), letting Uny = Vasey — UeVasey, and 
using the same argument which leads to (3) and (4), we have 
(5) DX u(UrVauws) <3e and a—5e< DY) w(Uny) Sate 
j.f j.4 


for all n > N. 
By the choice of V, U;U,n4; is monotonically decreasing as a function of n 
for each j. Letting U,; = ™,.U;:Unj, we have from (5) 
(6) lim 2) w(Unis) = DE lima(UWUaws) = LE w(U)) > a — 5e. 
nam ji | 


na & j 


We shall now show 


(7) > u(U;NH;) >a — 8. 


P| 
Since 


u(Vy;) = w(Vyy — (U,U H;)) + w(H, VU U5) 


and 
u(H,U U;) = w(H;) + w(U;) — w(K UV), 
we have 
> (4(A, + (UV) — uA, NU)) < ¥ ulVws) Sate 
P| 
or 


a—2+a-—5e— >> w(H, NU) <ate 


| 
This yields (7). 
Pick a j such that u(U,\ H,;) > 0. Then 


w*(EC\ Ey, (\ U;) > w(U,1\ HH; > 0. 
Let C = E\ Ey,(\ U;. Then for arbitrary but fixed » > N we have 
Vag OV Vang OV 4 = @ for i = k. 


We shall show that f is continuous on C. Let 6 > 0 be given. Then there 
exists an m such that 2-" < 6. Let x» be in C. Then 


If (x0) —f(x)| <2" <6 








re 





a ge ae 





ON A PAPER OF MAURICE SION 415 


for all x in Vis; \ C where i is such that x is in Ey. Therefore f is con- 
tinuous on C contrary to hypothesis on f. This contradiction proves the 
theorem. 


REFERENCES 


1. K. Gédel, The consistency of the axiom of choice and of the generalized continuum-hypothesis, 
Proc. Nat. Acad. Sci. U.S.A., 24 (1938), 556-7. 

2. W. Sierpinski, Les ensembles projectifs et analytiques, Mémorial des Sciences Mathématiques, 
no. 112 (1950). 

3. W. Sierpinski and A. Zygmund, Sur une fonction qui est discontinue sur tout ensemble de 
puissance du continu, Fund. Math., 4 (1923), 316-18. 

4. Maurice Sion, Variational Measure, Trans. Amer. Math. Soc., 83 (1956), 205-21. 


Xavier Unwersity 











ON GENERALIZED MORSE-TRANSUE FUNCTION 
SPACES 


H. W. ELtis 


1. Introduction. Marston Morse and William Transue (6, 8) have 
introduced and studied function spaces, called M7-spaces, for which the 
elements of the topological dual are of integral type. Their theory does not 
admit certain classical Banach function spaces including spaces of bounded 
functions and &;-” spaces. The theory of function spaces determined by a 
length function (A-spaces) (4, 5), which depends on a fixed measure, admits 
many of the maximal MT-spaces, the spaces &¢” and spaces of locally inte- 
grable functions but does not admit certain maximal MT7-spaces including 
the space R- of complex continuous functions with compact supports. 

In (4) the definition of M7-spaces was weakened by dropping the require- 
ment that &¢ be dense in the space and making no hypothesis concerning 
the dual. The resulting spaces were called M7*-spaces and the elements of 
intregal type in the dual then constituted the M7-conjugate of the space. A 
A-space (4) is an MT*-space if it contains R-. The MT-spaces are just those 
MT*-spaces for which the dual and MT7-conjugate coincide. The space of 
bounded functions on a suitable space E is an MT*-space that is neither an 
MT- nor a )-space. 

In the development of the theory of MT-spaces an important role was 
played by the fact that the semi-norm Jt* could be defined in A and extended 
to all of C¥ by (3.2) below. Since there are MT*-spaces for which the MT- 
conjugate reduces to the zero element of the dual (§ 3), (3.2) is not valid for 
every MT*-space. For an Jt*-extensible MT7*-space (Definition 3.2) (3.2) 
holds. Since 94 is then a reflexive semi-norm, the MT-conjugate is then 
dense in the dual of A in the ¢(A’, A) topology (Theorem 3.1). The 94- 
extensible MT*-spaces have many of the properties of general M7-spaces. 

The last part of this paper is mainly concerned with the role played in 
the general theory of M7T*-spaces by the A-spaces. When E is countable at 
infinity this can be simply stated as follows. If A is a \-space containing 
Reo, A is an Pt4-extensible M7T*-space for which every measure in Y* is of 
base » (Theorem 3.3.). Conversely if A is an 9t4-extensible M7*-space for 
which every measure in A* is of base u, R4 extended by (3.2) determines 
a length function \ (Theorem 4.1) and &,, the A-space determined by A, and 
24 (§3), coincide on some y-measurable set B with E — B Y*-negligible 
(Theorem 4.3). If then A is an MT*-space of Cauchy type, A = 2,* = 24 


Received October 6, 1958. 
416 











_ st FF 














GENERALIZED MT SPACES 417 


on B. Thus an MT-space of Cauchy type on a locally compact space E that 
is countable at infinity coincides with a \-space for u on the restriction of 
E to some u-measurable set B with E — B %*-negligible if and only if every 
element of A’ is of base yu. 


2. The MT-conjugates as vector spaces. Let E be a locally compact 
space, C* the vector space of functions on E valued in C the field of complex 
numbers. A semi-norm on a vector subspace A of C* will be called monotone 
if N4(x) < NM4(y) when |x(t)| < |y(d)|, x, y € A; non-trivial if N4(x) #0 
over A (6). 


Definition. A vector subspace A of C*¥ will be called an MT7*-space if it 
contains &¢, if with x it contains |x| and Z and if it has a non-trivial, monotone 
semi-norm ¥i+. 

If A’ is the dual of A topologized by M4 as a semi-norm and if y € A’, 
then the restriction of y to &¢ determines a C-measure y and 


(2.1) y(x) = fx dy, 


for every x € Rc (6). We denote by A* the subspace of elements y of A’ 
for which every x € A is y-integrable with (2.1) holding and call such a y 
an element of integral type. We call A* the MT-conjugate of A. As in (6) 
the mapping y > y of A* into Mc, the space of measures on E, is an iso- 
morphism. We denote by YW’ and Y* the images of A’, A* in Me and call 
%* the M7-measure conjugate of A. We define for each y € A*, y € Y*, 


\y| * = sup |fix dy IMA (x) = sup ly(x)|/N4(x) = lylae = lyla-, 
ba z#0 r#0 
reA reA 


where |y|,4- is the usual norm on A’. There are corresponding definitions for 
real MT*-spaces. 

A* is a vector subspace of A’. Let yi, y2 € A*, a,b € C. Then z = ay, + 
by. € A’ and determines a C-measure z. From (2.1) for &- it follows that 
z= ay, a bye. By (6, Corollary 9.1) every x € A is ay, + bye = z-integrable 


fx dz = fx d(ay, + by2) = ay:(x) + bys(x) = 2(x). 


The spaces A* and %* are thus normed vector spaces, equivalent by definition. 
Morse and Transue (6, p. 153) associate with each C-measure 9 on E a 
unique positive measure |m| such that for x € K, x >0, 


(2.2) In|) = sup | fudn|. 
ul<z 
ute 


The absolute measure || defined by 7 then has a unique extension |m|, as a 
real C-measure on E (6, p. 151). 


Condition 2.1. If » € A*, |n\|. € A* and nos = 


II m\ el gre: 











418 H. W. ELLIS 


Condition (2.1) is the analogue for the M7-conjugate spaces of the con- 
dition for A that |x| € A if x € A (noting that the monotone property of 
N4 implies that N4(x) = MN4(\x!)). If, for a positive measure yu, the C-measure 
n is of base uw (that is, can be written in the form g(t) .u with g(#) locally 
u-integrable (3, p. 42; 7, § 3). 


(2.3) g(t) . wl = |g()| .w. 


When all the elements of &%* are of base u, A* can be identified with the 
collection of functions {g(#)}. If then A* is an MT*-space Condition 2.1 is 
necessarily satisfied. We note also that it is trivially satisfied when A* = 0, 
that it is satisfied by the measure dual of every MT-space (6, Lemma 11.2) 
and by the measure dual of every MT*-space that is a A-space with the 
MT- and )-conjugates coinciding (4). 

Suppose that a; i = 1,2,..., are positive measures with a,, € A* and 
that Ziti! gre < «. Then for every x € ®, x > 0, 

Dd ladx)| = SS a(|x]) < N(x) ltnelge < @ 


1 1 1 


so that the a, form a summable family of positive measures on E and determine 
a positive measure ap = 2", (3, § 3, no. 5). 


THEOREM 2.1. Let A be an MT*-space for which Condition 2.1 holds. If every 
real x in A is ao-tntegrable for every ao defined as in the preceding paragraph, 
then A* is complete. 


Proof. The theorem is trivial when %* = 0. In the general case let {7,} 
denote a Cauchy sequence in %{* and choose a subsequence {7,,} with 


2) 


Inns lope + > Insss — trl ga= L < @. 
Define 
a1 = Inala = Inecss — Mail? = 2,3,...,00= DO ae 
Condition 2.1 implies that each a,,, is in A* with 
lar.ely, - Inns | oye 
leselge = Itacsr — Melgar t = 1,2,.... 


By hypothesis each real x € A is ao-integrable so that (3, Proposition 5, 3°) 


f= dex fixe 
1 


If x € A, x = x1 + tx2, with x, and x, real and in A,x is ap,,-integrable 


(6, Lemma 4.3) and 


I 
M 








t 


T 
s 
p 
s 


—, © 





ne 


3°) 


dle 














GENERALIZED MT SPACES 


fx da 


Sirrdas +i fixrdas = p fxd tid x2 dar, 
1 1 
> fix das. 
1 
| ff das, re J ieldare < LRG), 
1 


It follows that fx dao, determines a continuous linear functional y of integral 
type with y = ao,, and therefore ay, € A*. 
For each x € A, 





[ng +1 (%) a ng (X)] 
is a Cauchy sequence in C since 


p> [nc oi(X) — ma(x)]] < DS ar(\x|) +0 


as p,q— ~. Thus 


is) 


(2.4) n(x) = ny (x) + p is [Mni+4(X) = Mn; (x)] = lim Mn; (X) 


1 


is defined in C for every x € A. Now 7 is linear on A and continuous since 
(2.5) In(x)| < ao(|x!) < LMNA4(x) 


for all x € A. Thus » determines an element of A’. 

It follows from (2.2) and (2.5) that |n| (x) < ao(x) for every x > 0, x € &. 
This implies that |n|*(x) < a*o(x) for every x > 0. Thus every ao-negligible 
set is |n|-negligible and every a-measurable function is |n|-measurable (2, 
p. 180). Thus if x € A, |x| is |n|-measurable and x is 7-measurable (6, p. 168). 
Since 

S\xldn < f\xldas < L N4(x) < @, 


every x in A is y-integrable (6, Theorem 9.4). This with (2.5) shows that 
fx dn determines an element y € A* with y = 9 so that n € W*. 
Then 


In — lye = sup | J x d(n — ms)|/R* (x) 


< sup Sf |x| da,/R*(x) 
OreA i+1 


@ 


< » las, elore 


which approaches zero as i— ©. The full sequence {y,} then converges to 
» in A* so that A* is complete. 


Coro.iary. If E is countable at infinity and A is an MT*-space for which 
Condition 2.1 holds, A* and U* are Banach spaces. 








420 H. W. ELLIS 


Proof. By (3, Corollaire 2, p. 28) every x € A is ao-integrable. 


Length functions for a positive measure yu are defined in (4, 5). We denote 
by %, &-* the subspaces of R*¥ and C* respectively consisting of u-measurable 
functions x(#) with A(x) = A(\x|) < @ (cf. 5, p. 577). (If x(t) € C4, it is 
u-measurable for » > 0 if its Riesz components are u-measurable (6. p. 168).) 

We show that if A = &-'(E,u) (4, §2) with E and yw defined as in (2, 
Exercise 4, pp. 116) A* is not complete. We define g;(P) = 1/Inn, P = (1/n, 
k/n*), nm = 2,3,...4; g:(P) = 0 elsewhere; g(P) = 1/Inn, P = (1/n, k/n?), 
n = 2,3,...; g(P) = 0 elsewhere. The g,; form a Cauchy sequence in A’ 
and converge to g. Each g;. uw is in U* but g. u is not. 

The A-conjugate of every \-space is complete since it is also a A\-space (4). 
Thus the MT-conjugate of an arbitrary \-space containing R¢ is complete 
when it coincides with the A-conjugate. 


3. N4-extensible MT*-spaces. For a normed or semi-normed space X 
we let X, denote the subunit elements of X, that is, the elements with norm 
or semi-norm not exceeding unity (cf. 6, p. 171). 

Definition 3.1. A semi-norm {4 on an MT*-space A will be called reflexive 
if for every x € A, 


(3.1) N* (x) = sup f xdn : 
yeu? 


THEOREM 3.1. In order that N* be a reflexive semi-norm on the MT*-space 
A it is necessary and sufficient that A*,, be dense in A,’ for the a(A’, A) topology. 


Proof. Since A,’ and A,* are équilibré parts of A’, the polars of A,’ and A, 
are respectively A,’° = (x € A : |y(x)| < lforally € A,’) and A,*° = (x € A: 
y(x)| < 1 for all y € A*,) (1, p. 52). We first show that A*®, = A,’°. Since 
A*, C A,’, A,’® D A*®, and it is sufficient to prove the opposite inequality. 
If x € A**,, the hypothesis that 94 is reflexive implies that 


N*(x) = sup | fxdy < 3, 


yell? 
Thus |y(x)| < N4(x)\yl4- < 1 if y € A,’ so that x € A,’”*. 
Thus A*®, = A,”° and it follows that A*®, = A,’ = A,’. Since A*, is 
convex and contains 0, the argument of (1, Proposition 3, p. 52) shows that 
A,' = A*™®, is the closure of A*, for o(A’, A). 


We next prove that the condition is sufficient. Since the definition of || ,* 
implies that > holds in (3.1) we need only show that, given « > 0, there 
exists y € A*, with N4(x) < |fxdy| + «. 

By an extension of the Hahn-Banach Theorem there exsits yo € A,’ with 
yo(x) = RA4(x), lyola- = 1. The set [y € A’; |(y — yo)(x)| < €] is a neigh- 
bourhood of yo for the ¢(A’, A) topology and by hypothesis contains y,; € A*,. 
Then 


0 < N4(x) — fx dy,| < |yo(x) — yi(x)| = | (vo — yi) (%)| < e. 

















GENERALIZED MT SPACES 421 


We note the analogy with the relation between E and E” for Banach 
spaces (1, Proposition 5, p. 114). 


Definition 3.2. A semi-norm on an MT*-space will be called extensible if A 
satisfies Condition 2.1 and ¥“ is reflexive. An MT7™*-space will be called 
M4-extensible if it has an extensible semi-norm. 

For an extensible semi-norm 


* 
(3.2) N*(x) = sup f \xc|d || 
W* 
newsAu 
holds with outer integrals replaced by integrals for every x € A. Formula 


(3.2) then extends the definition of 4 to all of C* and all of R*. 

Given a collection of C-measures I?% a function x € C*¥ or R® will be called 
M-negligible if |x(t)| is |n\-negligible for every » © M. M-negligible sets, 
M-equivalence and almost everywhere (Mt) are then defined by analogy 
with the case where IU reduces to a single C-measure 7. When A is an ¥t4- 
extensible M7*-space, N4(x) = 0 if x is U*-negligible. If then x(t) is defined 
and valued in C or R almost everywhere (U*), x is U*-equivalent to some 
z in C¥ or R* and we define N4(x) = N4(%). When A* = O every function 
is U*-negligible but M4(x) > O holds for some x € A. 

THEOREM 3.2. For 1c p< ©, A = Lc (E, yu) is an N4-extensible MT*- 
space. 


Lemma 3.1. If A = &2(E,u) is an MT*-space for which the -conjugate 
contains the MT-conjugate, then Condition 2.1 is satisfied and every element 
of U* is of base yp. 

Proof of Lemma 3.1. Every g in the \-conjugate is locally u-integrable and 
therefore determines a measure g.y (that is, a measure of base uz) (4, § 3). 
If g € A*, g = g.u and thus the elements of %* are of base yu. 


The definition of the A-conjugate then implies that |g(t)| € @*. By (7, § 3) 
g|.u = |g.u|. Now f x|\gidu < A(x)A*(g) < @ and the g. w-integrability of 
x implies that {\x\d\g.u| < © (6, Theorem 9.4). Thus by (7, Theorem 1.1), 
for every x € A, 


g| (x) = fxlg\du = fxd(\g ite 
so that (|g| .u). € &*. It then follows from the definitions that 
(\g BD el ons = A*(\g\) = A*(g) = 8 + Hine: 
Proof of Theorem 3.2. It remains to be shown that i? is reflexive as a 
semi-norm on A = &%,”. Since J?” is reflexive as a length function, 


N(x) = sup |fxgdu|>sup |fxgdul, 
* 


oe(E), gelle 


and it is sufficient to determine g € A*, with fxg dy| arbitrarily near to N(x). 











422 H. W. ELLIS 


If M(x) < @ there exists Ey) = U,°K,, where {X,} is an increasing 
sequence of compact sets for which, writing f, for the product of the function 
f(t) and the characteristic function of the set B, 

N’(x) = N’(xm) = N(x ae) 
(4, § 2). Now 
Xe, € Le” 
and &,” is R4-extensible as an M7-space. Thus 
N'(x) = R(xx,) = sup | fxs, gdyl. 
oe(XC)y 
Since E» is u-measurable and 
lgmo(t)| < |g(t)|, geo € 2% 
if g € L,*. Thus 
N’(x) = N'(xz,) = sup | f x ge, dul. 
Onye(XC)s 
For g € (fc*), fixed, 
|S xgecdul|—| fx gx du| 
asi—o and 
gx: € (&0)u 
Thus for i sufficiently large and a suitable 
gE (Lo)w ges € (2E)u 


with |fx gx,du| arbitrarily near R(x). The C-measure gx; - has compact 
support so that CK, is gx, .u-negligible (2, Proposition 5, p. 119). Thus 
if f € A and fgg, vanishes in E, 


S*lfldlexs - ul < J*lflexs d(\gxs| mu) + J*lfleid(lgns| -u) = S\fexildu = 0 
and the complex analogue of (4, Theorem 3.1) implies that 


gx,-u € U,*. 


THEOREM 3.3. If d is a reflexive length function for the positive measure p, 
if E is countable at infinity or if E is arbitrary and A = 2 is an MT*-space 
for which the MT- and )-conjugates coincide, then A is Nt4-extensible and every 
measure in U* is of base up. 


Proof. Theorem 3.3 is a consequence of Lemma 3.1 and the fact that the 
reflexivity of 24 = d as a length function implies that it is reflexive as a 
semi-norm on A. 


When A is an 9t4-extensible M7*-space we denote by 4 the vector sub- 
space of C*¥ of mappings x with M4(x) < ©. Then §4 is a non-trivial, 








GENERALIZED MT SPACES 423 





monotone semi-norm on §4 and 4 is an M7*-space for which Condition 2.1 
holds. If for each 7 ¥ 0 in U* there exists a relatively compact set e() that 
is not 7-measurable the M7-conjugate of 4 reduces to the zero element 
of A’. Such non-measurable sets exist, for example, if A = &,” (EZ, 4) with 
E = (0,1) and w Lebesgue measure on E, 1 < p < @. In contrast, if E is 
arbitrary, if A = R- and 4 is the uniform semi-norm, 94 extends to C*¥ 
in the form (6, Theorem 15.3), 


NA (x) = sup |x(t)| 


and #4 is the space of all bounded functions on E which is an 9t4-extensible 
MT*-space. 

We note that if B = §4, where A is an arbitrary N4-extensible M7*- 
space, It“(x) > 0 is possible for a B*-negligible function in B but R4(x) = 0 
for every U*-negligible function in B. 

The properties of the extended semi-norm 94 and of 4 for M7-spaces 
(6, § 12) extend to M4-extensible M7*-spaces with A’-negligibility replaced 
by &*-negligibility. In particular §4 is complete. 

Generalizing (6) we define 

O = () 26(E, 9) 
gel® 


for every MT*-space A. We define 2:4 = 24 (\ §4. Then Q* is an MT*- 
space with 94 (extended) as a semi-norm. 


THEOREM 3.4. If A is an N4-extensible MT*-space and if A* is complete 
or, more generally, tonnelé (1, § 1), then Q)4 = 24. 


Proof. The argument of (8, Theorem 5.1) applies. We note in particular 
that 2,4 = 24 for every N4-extensible M7*-space A if E is countable at 
infinity (Theorem 2.1, Corollary). 


4. \-spaces generated by Ji*-extensible MT*-spaces. 


THEOREM 4.1. Let A be an N+-extensible MT*-space, u a positive measure 
on E. Then N+, extended by (3.2), defines a length function for uw if and only 
if every u-negligible set is U*-negligible. 


Proof. By (3.1) and the subsequent remarks ¥{“(x) is defined for every 
x(t) that is defined almost everywhere (A*) and valued in R¥ and therefore 
for every x(t), u-measurable and defined, non-negative and valued in R almost 
everywhere (%*). That 94 then satisfies Conditions (L2)—(L5) for length 
functions (5) is then easily verified. We verify (L5). If x,(t) € R*® is non- 
negative and y-measurable, m = 1,2,..., and if x,(¢) increases to x(¢) as 
n— o, then for each 7 € Y*, 


f*x(t) d\n| = sup,f*x,(t) d\n|, 
by (2, Theorem 3, p. 110). Thus 











424 H. W. ELLIS 


N* (x) = sup f* x(t) d|n| = sup sup, f* x,(t) d\n] 


nedle weds 


sup, IN“ (x,). 


If (L1) (5) holds every u-negligible set is U*-negligible. Conversely if every 
u-negligible set is U*-negligible, M4 is defined and non-negative for every 
x(t) that is non-negative a.e. (u) (and therefore a.e. (Y*)) and if x(¢) is p- 
negligible and e = [t : x(t) # 0], e is u-negligible (2, Theorem 1, p. 119) and 
therefore U*-negligible. This implies that x(t) is n-negligible for every » € U* 
and (3.2) then shows that 94(x) = 0 giving (L1). 

We note that there exist 2t¢-extensible M7*-spaces, in fact MT-spaces on 
a compact set E, for which Jt cannot define a length function for any measure 
u. Consider the MT7-space A = €-(E) of complex valued functions con 
tinuous in E = [0, 1] with semi-norm {4(x) = sup,» ¢|x(t)| and suppose that 
M4 defines a length function for some positive measure yw. Then, since W* 
contains all the point measures, the empty set is the only W*-negligible set 
and therefore, by the preceding theorem, the only u-negligible set. For each 
t, 0 <t <1, the set {¢} consisting of the point ¢ is closed and therefore z- 
measurable and u({t}) > 0. For some a > 0 there is a collection of points ¢, 
of E with u({t,;}) > a, « = 1,2,.... Thus for the characteristic function 
of E, XB 


u(xe) = w(E) > lim, w(Uit:) > lim, na = @, 


contradicting the assumption that yw is a measure since xz € Ec. 
The following theorem is a partial converse of Theorem 3.3. 


THEOREM 4.2. Let A be an Q4-extensible MT*-space, u a positive measure 
on E and suppose that all of the elements of U* are of base wu. Suppose that every 
u-negligible set is U*-negligible and that every U*-negligible set is locally p-neg- 
ligible. Then A C A C &¢* = Q4 C F4. 


Proof. By Theorem 4.1 94 determines a length function \ for u. We denote 
by &,* the A-space determined by A. By hypothesis every 7 € A* can be 
written 7 = g.y where g(t) is locally w-integrable. We identify the functions 
g(t) with A*, the measures g.u with A*. If E(g) = (¢: g(t) # 0), E(g) is 
u-measurable and, for every x € 24, x2.) (#) is u-measurable (3, Proposition 3, 
p. 43). Given a compact set K in E with u(K) > 0 consider, for all g € A*, 
the collection of subsets E(g) of K with u[E(g)] > 0. From this collection 
form a maximal collection of disjoint sets and let B denote their union. Since 
this collection will be at most countable B will be u-measurable. If g € A*, 
gx—p € A* and u[E(gxe_s)] = 0 for otherwise BU E(gx_») properly contains 
B contradicting the definition of B. Thus, for every g € A*, g(t) = 0 almost 
everywhere in K — B, g.u(K — B) = 0 and K — B is A*-negligible and 
therefore, by hypothesis, K — B is u-negligible. If x € 24, xg is u-measurable 
and therefore xx is u-measurable. It follows from (2, Proposition 4, p. 182) 





aA na & te he 


ni -— Ad 











GENERALIZED MT SPACES 425 


that every x € 4 is u-measurable. If x € Qo4, N4(x) < @ and x € &,*. Thus 
A C Qo4 C Le’. Since 2,* is complete it is closed in 4 and contains A, the 
closure of A. 

To prove that &-* C 2)“ we must show that every u-measurable function 


x(t) with N4(x) < © is in &¢'(g. yu) for every g € A*. Every x(t) € 2° is 
u-measurable by definition so that the Riesz components of x(t) are u-measur- 
able (6, p. 168). The Riesz components are then measurable (|g. u| = |g! . ») 


for every g € A* (3, Proposition 3, p. 43). Thus x(¢) is measurable (g . «) for 
every g € A*. Since for each g € A*, |g. u!, € A*. it follows from (3.2) and 
(6, Theorem 9.4) that x(t) € &e'(g. x). 

We note that if to each compact set K corresponds g(t) © A* with g(t) # 0 
a.e. (u) in K, every U*-negligible set is locally u-negligible. This is true in 
particular if &* contains R¢ or the characteristic function of every compact 
set. 


THEOPEM 4.3. Suppose that E is countable at infinity or that E = Ey. U\"°K,y, 
with each E, compact and E, locally u-negligible, u a positive measure. Let A 
be an N4-extensible MT*-space for which all of the elements of A* are of base 


A ‘ 
pu. Then, if Eo is U*-negligible, the normed spaces L» and % * associated with 
QV“ and Qo4 are equivalent and contain A, the normed space associated with A. 


Proof. As in Theorem 4.2 each K, is the union of a u-measurable set B,; and 
an %*-negligible set. If B = U,"B,, xg is u-measurable for every x € 04. 
Every g € A* vanishes a.e. (Y*) in U,°K, — B. If not, for some g, i, 


ulE(gx,-2)] > 0, 


contradicting the definition of B,. It follows that B’ = E — B is U*-negligible. 
Thus for each x(t) € 24, x,(t) is u-measurable and Jt4(x — x,) = 0. If then 
x(t) € Qo4, xg(t) € Lo, vA = N4, with N4(x) = N4(xz) and Q®* C L-’. The 
proof that L,* C Q4 is similar to the corresponding part of the proof of 
Theorem 4.2. 

When E is countable at infinity 2,4 = 24. The space E defined in (2, 
Exercise 4, p. 116) is of the form D °K, with D locally u-negligible for the 
measure » defined there. For the spaces £,?, 1 < p < ~, Dis %*-negligible. 

We note that if ®- is dense in 2°, A = 2° in Theorem 4.2 and A = L,” 
in Theorem 4.3. 


5. MT*-spaces of Cauchy type. If A is an MT*-space, let B be the 
vector subspace of A over R of real mappings in A, B the associated real 
normed vector space. As in (8), with a natural definition of a partial order 
on B, B becomes a “Riesz space.” 


Definition 5.1. A complete 94-extensible M7*-space will be called an 
MT*-space of Cauchy type if each subset H of B, bounded in norm and 
filtering for the relation < defines a Cauchy filter. 








426 H. W. ELLIS 


For a maximal MT-space the definition reduces to that given in (8, § 1). 
The theory of M7-spaces of Cauchy type given in (8) extends to M7*-spaces 
of Cauchy type with A’-negligibility replaced by U*-negligibility and with 
24 replaced by Q4. 


THEOREM 5.1. If A is an MT*-space of Cauchy type then A = Qo%. If the 
hypotheses of Theorem 4.2 are then satisfied, A = %¢ = Qo* and, if the hypo- 
theses of Theorem 4.3 are satisfied A = L,* = Qo. 


We note that if A = &,* is an MT*-space of Cauchy type, the analogue 
of (8, Corollary 6.1) implies that \ satisfies (L9) (4, ((L9) as modified on 
p. 592)). Thus if E = [0, 1], ~« Lebesgue measure, the space &-” (E, u) is not 
of Cauchy type. 


REFERENCES 


1. N. Bourbaki, Eléments de Mathématique, Fasc. XVIII, “Espaces Vectoriels Topolo- 
giques,”’ chaps. m1-v (Paris, 1955). 

2. — Eléments de Mathématique, Fasc. XIII, “Integration,” chaps. 1-1v (Paris, 1952). 

3. ——— Eléments de Mathématique, Fasc. XXI, ‘Integration,’ chap. v (Paris, 1956). 

4. H. W. Ellis, On the MT*- and d conjugates of © spaces, Can. J. Math., 10 (1958), 381-91. 

5. H. W. Ellis and I. Halperin, Function spaces determined by a levelling length function, 
Can. J. Math., 5 (1953), 576-92. 

6. Marston Morse and William Transue, Semi-normed vector spaces with duals of integral 
type, Jour. d’Analyse Math., 4 (1955), 149-86. 

Products of a C-measure and a locally integrable mapping, Can. J. Math., 9 (1957), 

475-86. 

Vector subspaces A of C® with duals of integral type, J. Math. pures et appl., 

Series 9, 37 (1958), 343-363. 





7. 








Queen's University, 
Summer Research Institute of the Canadian Mathematical Congress 














ON A TYPE PROBLEM 
JAMES A. JENKINS 


Considerable interest has attached to the problem of determining the type 
of a Riemann surface obtained by performing an identification between the 
edges of a strip or a half-strip (1, 2, 4, 5, 8). A fairly thorough analysis was 
made in 1946 by Volkovyskii (6) who gave various sufficient conditions for 
parabolic and hyperbolic type. The object of the present paper is to show that 
his principal sufficient condition for hyperbolic type can be substantially 
improved. 

We regard the half-strip S in the z-plane, z = x + iy 


x>a,0<y<b 


where a and 6 are finite real numbers, 6 > 0. (The case of a full strip is entirely 
equivalent.) We denote its edges x > a, y = 0 and x > a, y = 6b respectively 
by L; and L». We consider the identification on L; and L, determined by the 
mapping (defined for x > a) 


T(x) = f(x) + 


where f(x) is an increasing function of x with f(a) > a. We will suppose that 
this identification determines a Riemann surface # which is then doubly- 
connected with one boundary component C determined by the segments 


x=a,0cycbacgcx<cfa,y=b. 


For this to hold it is necessary and sufficient that for each xo > a there exist 
a disc |w| < 1 divided by a simple open arc \ into two domanins D,; and D, 
and neighbourhoods E,; and E» of xo and T(x») relative to S such that there 
exist conformal mappings w = ¥;(z), i = 1, 2, of E,; onto D, each admitting a 
homeomorphic extension to an open boundary arc y, of E; and LZ; which it 
carries onto \ and such that 


¥i(x) = ¥2(T(x)), x € 7. 


Non-trivial necessary and sufficient conditions on f(x) for the identification T 
to determine a Riemann surface are not known. Some sufficient conditions 
were given by Volkovyskii (7). An easily verified sufficient condition is that 
f(x) should possess a continuous derivative which does not take the value 


Received August 20, 1958. Research supported in part by the National Science Foundation, 
Mathematics Section, through the University of Notre Dame and by the Office of Ordnance 
Research under contract No. DA-36-034-ORD-2453 through the Institute for Advanced 
Study. 


427 











428 JAMES A. JENKINS 


zero. For our purposes here we assume that f(x) and its inverse are absolutely 
continuous, that the identification 7 does determine a Riemann surface and 
that the extension of ¥,(z), i = 1,2, to y; is continuously differentiable for 
each x» together with its inverse mapping. Without some restriction it is not 
@ priori certain that such a surface is uniquely determined by the identifica- 
tion 7. Volkovyskii (7) gave some sufficient conditions for this to hold but 
they are probably far from necessary. 

A Riemann surface # determined by the above identification can be mapped 
conformally on the circular ring 


L<\t]<R 


where C corresponds to the boundary component |f{| = land R < ©. We will 
distinguish the cases according as R < » or R= @ as being respectively 
hyperbolic or parabolic. 

In order that we have the hyperbolic case it is sufficient that # have finite 
module for the family of curves I separating its boundary components (3, p. 
13). That means that if p (w)|dw| ranges over all conformally invariant metrics 
on & (w denotes a local uniformizing parameter) such that for any locally 
rectifiable simple closed curve y separating the boundary components of # 


J ola >1 

a 

min Sf p du dv 
R 


is finite. In particular it is enough to manifest one admissible metric for which 
the above integral is finite. First, however, we wish to study the images in 
S§ of the family of curves T on &. 

To curves on & in T correspond sets on S which may display quite con- 
siderable complication. We will denote the family of such sets by I'*. Points 
of such a set y* on L, are accompanied by their images on L2 under the map- 
ping 7. For any set w in § we will call the (orthogonal) projection of w — Ls 
on L;, the projection +(w) of w. We denote the function f(x) iterated times 
by f, (x), also we take fo(x) = x. Points (x’, 0), (x’’, 0) on L; such that 


x" - Tr (x ) 


for some n will be called congruent points. The essential property of sets in 
I* is given in the following lemma. 


then (w = u + i) 


LEMMA. For every y* € I™* there exists a value c, c > a, such that x(y*) 
contains a point congruent to each point in the interval {c, f(c)). 


The set y* consists of arcs running from L, to Le, intervals on ZL; and L, 
and arcs running from L, back to L, or from L» back to Lz. We note first that 











A TYPE PROBLEM 429 


since the corresponding curve y € I is compact there can be only a finite 
number of arcs running from L, to L2. Further we can replace these arcs by 
rectilinear segments with the same end-points not increasing their projections. 
If we replace an arc running from L,(L2) back to L;(L:2) by the segment joining 
its end-points in the projection we at most replace a segment by a congruent 
segment. Finally a number of segments on L,(ZL2) described consecutively 
(possibly overlapping) joining two points can be replaced by a single segment 
joining these points without increasing the projection. Thus it is enough to 
prove the result of the lemma when 7* consists of a finite number of segments 
joining L,; and Lz and lying on L,; and Le. 

There must be at least one segment joining ZL; and Ly». Let then P, be the 
end-point of such a segment farthest to the right on L; and P, be the end- 
point of such a segment farthest to the right on Lo. If P, = T(P:), y* con- 
sists of a single segment and the result of the lemma is evident. If P2 is to the 
right of 7(P;) it must be either the end-point of both a segment joining L; 
and L, and a segment on L» or the end-point of two segments joining L,; and 
L». In the first instance replacing the segments by a segment forming with 
them a triangle (and deleting the corresponding segment on L;) we obtain a 
new y’ € I* with one less side (counting only one side for a pair of corre- 
sponding segments on ZL; and Lz) and a not larger projection. In the second 
instance replacing the segments by the segment on L, forming with them a 
triangle (and inserting the corresponding segment on L:) we obtain a new 
7" € I* with one less side and a not larger projection. Similarly if P, is to the 
left of 7(P1), P: must be either the end-point of both a segment joining L, 
and ZL, and a segment on L;, or the end-point of two segments joining L, and 
L». Proceeding as before the same conclusions apply. Since the result of the 
lerama is true for y* consisting of a single segment it follows in full generality 
by induction. 

We now make our final assumption, that 


(1) lim f,(a@) = @. 


We denote by J,, m = 0,1,..., the region 
fala) Cx < fasila),O Cy < 5. 
Let ¢,(x) denote the function inverse to f,(x). Let u(x) be a non-negative 


integrable function defined for a < x < f(a) with 


S(a) 
0< f u(x) dx. 
Then we consider in S the metric p(z)|dz| where 


p(z) = w(on(x)) o,(x), s € I, 


Let y* € I*. Under assumption (1) it follows from our lemma that r(y*) 
contains a point congruent to every point of the interval [a, f(a)). Thus 











430 JAMES A. JENKINS 


(a) 
fpterlds| > fo ue) ae. 
7 a 
On the other hand 


SJ P@ aay 


> SJ uoomawr dx dy 


In +1 a) 


b> (w(oa(x)) 62 (x) ]? dx 


n=(0 In(a) 


~ © (u(x))* 
1 te 


n=O 


Sta) @ 
Sf won (E sts) 


provided that the operations involved are legitimate. This will be the case if 


ll 


converges at those points where u(x) is positive and the last integral is con- 
vergent. In these circumstances the identification determined by f comes under 
the hyperbolic case. 

We state our result as follows. 


THEOREM. If the function f(x) defined for a < x is absolutely continuous 
together with its inverse, if the identification it provides on the strip S determines a 
Riemann surface &, if the corresponding functions ¥;,(z), i = 1,2, admit ex- 
tensions to the open boundary arcs of E, on L; which are continuously differentiable 
together with their inverse mappings, if f(x) satisfies the condition 
(1) lim f,(a@) = © 

Na 
and if the series 
— 1 
—r 
n=0 In(x) 
converges on a set of positive measure on |a, f(a)) then the identification comes 
under the hyperbolic case. 


Indeed this sum is greater than or equal to one on the set of positive measure 
in question; thus we can take u(x) in the preceding argument as the reciprocal 
of the sum on that set and elsewhere zero. 

This result represents a substantial improvement of one of Volkovyskii's 
basic results which requires that the above series have a bounded sum on an 
interval as a sufficient condition for the hyperbolic case in addition to other 
requirements on the function f(x) some of which are not germane to the 
present problem. It is immediately seen that the present considerations ex- 
tend similarly to the case of identification of two strips also discussed by 


Volkovyskii (6). 





oe ee 


— 


— SSS 








A TYPE PROBLEM 

























REFERENCES 


1. C. Blanc, Les surfaces de Riemann des fonctions méromorphes, Comm. Math. Helvet., 9 
(1936-7), 193-216, 335-68. 

, Les demi-surfaces de Riemann. Application au probléme du type, Comm. Math. 
Helvet., 10 (1938-9), 130-50. 

3. James A. Jenkins, Univalent functions and conformal mapping (Springer-Verlag, Berlin- 
Géttingen-Heidelberg, 1958). 

4. R. Nevanlinna, Ueber die Polygondarstellung einer Riemannschen Fliche, Ann. Acad. Sci. 
Fenn. Ser. Al, no. 122 (1952). 

5. —-——, ‘‘Polygonal representation of Riemann surfaces,"’ Lectures on functions of a complex 
variable (Michigan, 1955), 65-70. 

6. L. I. Volkovyskii, On the problem of type of simply-connected Riemann surfaces, Mat. 
Sbornik, 18 (N.S.) (1946), 185-211 (Russian, English summary). 

, Quasiconformal mapping and the problem of conformal pasting, Ukrain. Mat. Z., 3 

(1951), 39-52 (Russian). 

f ’ 8. E. M. Wirth, Ueber die Bestimmung des Typus einer Riemannschen Flaiche, Comm. Math. 

Helvet., 31 (1956), 90-107. 








University of Notre Dame 
5 Institute for Advanced Study 





wo 2S ee ee 


— 






— oer 











ON SOME PROPERTIES OF FUNCTIONS ANALYTIC 
IN A HALF-PLANE 


P. G. ROONEY 


1. Introduction. The spaces $,(w), w real, 1< p < @, consist of those 
functions f(s), analytic for Re s > w, and such that u,(f;x) is bounded for 
x > w, where 


wae) we OM tte + iw ay 
(1.1) up (fix) = 5 a f(x + ty) |? dy. 
Doetsch (1) has shown that if e~*‘¢(t) € L, (0, ~), 1 < p < 2, and f is the 
Laplace transform of ¢, that is, 


f(s) = f e ** o(t)dt, Res > », 
0 


then f € §,(w), where 
(1.2) p'+¢' =1, 


and that conversely if f € 5,(w), 1 < p < 2, then there is a function ¢, with 
e~*'o(t) € L, (0, ©), such that f is the Laplace transform of ¢. 

The proofs of Doetsch’s theorems are based on a generalization of Plancherel’s 
theorem due to Titchmarsh (5). Titchmarsh’s theorem states that if F ‘ 
L,(— ©, ©), 1 < p < 2, then F has a Fourier transform G € L, (— ©, &). 

However, there are other extensions of Plancherel’s theorem due to Hardy 
and Littlewood (3). They have shown that if F € L, (— ~, ~),1<p <2, 
then F has a Fourier transform G such that |x|'-?’"G(x) € L, (— ©, ©), and 
that conversely if |x|'~*/“F(x) € L, (— ©, ~), g > 2, then F has a Fourier 
transform G¢€ L, (— », «)—for this form of Hardy and Littlewood’s 
theorems see (7, Theorems 79 and 80). One might expect that a theory 
similar to Doetsch’s theory could be constructed from these theorems, and 
this we shall do here. 

To this end we define spaces %(w), 1 < p < @, to consist of those func- 
tions f(s) such that (s — w)'~*”f(s) € §,(w) (where (s — w)'~-?” takes on its 
principal value). This is equivalent to saying that »,(f; x, w) should be bounded 
for x > w, where 


(1.3) ve(f;x,w) = +f lx — w + iy!” “f(x + iy) |? dy. 


In § 3 we shall obtain theorems corresponding to Doetsch’s results for these 
new spaces. It will be noticed that H2(w) = Aw), so that one would expect 


Received July 28, 1958. This paper was written while the author was a fellow at the 1958 
Summer Research Institute of the Canadian Mathematical Congress. 


432 














FUNCTIONS ANALYTIC IN A HALF-PLANE 433 


that our new theorems should reduce, for p = 2, to Doetsch’'s theorems. This 


is actually the case. 

In an earlier paper (4) we generalized Doetsch’s theory. In order to obtain 
theorems dealing with the Laplace transformation of functions, of the form 
o(t), where e~*'o(t) € L, (0, ©) and A > 0, we “generalized"”’ the spaces 
§,(w) to spaces §,,,(w). We can carry out a similar programme here, and to 
this end we define spaces 4 ,(w) as follows. %,(w) = %(w); if A > 0, 
#,,(w) consists of those functions f in 4%(w’) for every w’ > w such that 


v»(f; w) is finite, where 


(1.4) »» (f;#) = xc —w)”" »(f;x, w) dx. 
The theorems corresponding to the results of (4) are obtained in § 4. 

In § 2 we prove certain preliminary lemmas concerning the properties of 
functions in 4%(w). 

2. Preliminary lemmas. 

Lemma 1. If f € 4 (w), 1 < p < &, then 

f(w + ty) = lim f(x +1y) 
re + 

exists for almost all y, and |\y|'-*”f(w+ iy) € L, (— ©, @). Further, 


(x — w + ty)'-*"f(x + ty) converges in mean of order p to (iy)'-*f(w + iy) 
asx — w+. Also, v, (f; x, w) tends steadily from below, as x — w+, to 


f ly|?~* |f(@ + iy) [? dy. 


Proof. The statement follows on applying (1, Lemma 7) to 
F(z) = (z — w)'-*f(z). 


LEMMA 2. Let f(s) be analytic for Res > w, and suppose 
J lx — w + iy? | f(x + iy)? dy 


is bounded for x1 < x < X2, where p> 1, x1 >w. Then as yo+, f(x + ty) = 
o(\y|'-*/*), uniformly in x for x1 + 6 < x < x2 — 56, where0 <5 < 4(x_ — x). 


Proof. Let @(¢) = (— if)'-?”f(w — 1£), where ¢ = & + in, and (— it)!” 
has its principal value. Then if 7 > 0, 


f |@(E + in)|? dé = f ln — i€|?*|f(w + 9 — it) |? de 


‘ f In + i€|?* |f(w + 9 + a€) |” dé 


which is bounded for x; — w < 9 < x2 — w. Hence by (7, Lemma, p. 125), 








434 P. G. ROONEY 


lime... ®(E + in) = O uniformly in » for x; — w + 6 < 9 < x2 — w — 3. Thus, 
setting x= w+, y = —é, 


lim (x — w + iy)**” f(x + iy) = 0 


vate 


uniformly in x for x; + 6 < x < x2 — 6. But clearly 
(x — w + ty)?” = O(\y|"*”) as yrs, 
uniformly in x for x in the same interval. Hence 


lim |y|'~*” f(x + iy) = 0 


vito 


uniformly in x for x in this interval; that is, 
f(x + ty) = of(|y|-O") = o(|y|'-*/4) 
uniformly in x for x1 +6 <x < x2 — 6. 
Lemma 3. If f € 4%(w), g > 2, and w < = < Res, then 


1 ¢_fe+)_ 
2a Js — (+t) 


Proof. Suppose first w < —§ < Res. Let s = x + iy, and choose R and p 
so that p > x, and R > |y|. Then 


1 f(¢) 
f(s) = i fa. 


the integral being taken around the rectangle with vertices § + iR and 
p + 1R. The integral along the upper side of the rectangle is given by 


f(s) = 


1 (*_f(@+iR)_ 

2ai Jes — (a + iR) 
But by Lemma 2, f(a + iR) = 0(R'-*”) as R- @, uniformly in a for 
— <a < p. Hence the integral along the upper side is 0(R-*”) and conse- 
quently tends to zero as R— . Similarly, the integral along the lower 
side of the rectangle tends to zero as R— ~. Hence letting R— @, 





a. 


_ 1 (?_fe + in) 1 (° fe+m)_ 
IS) = 35 Js — G+ a)" — 29 ds — @ + ae) 


Now the second of these integrals tends to zero as p — ©. For from Hélder’s 
inequality it is smaller in modulus than 


' ut |p — w+ inl, \” 
(»-(f; p, w)) aos Gece in) F274 ‘ 


The first term of this expression is bounded by hypothesis; since 1 < p < 2, 
the second term is smaller than 








~~ 





us, 


ir 





ER 


——S 





FUNCTIONS ANALYTIC IN A HALF-PLANE 435 











\f —w)* dn \ ” 
2a ~o((p — a + 7 _ yf 
(p — w)”* 1 ims a -2/p 

-{& (p — x) riJ az a+ “ cst =Oe™ 


as p— ©. Hence letting p— @ 


-. f(E + mm) 
red emer tos w<t< Res. 


It remains to show that this equation remains true when £ = w. For this 
we write the equation in the form 


~ % - \I-2/@ . f(& — w+ in)” 
0s) = ge fe — w+ in) 908 + in) SE a 


Thé first term of the integrand of this last integral converges in mean of 
order g to (in)'~*/*f(w + in) as £ > w+. We shall show that the second term 
of the integrand converges in mean of order p to (in)'~*/?/(s — (w + in)) as 
§— w+. Clearly it tends to this limit pointwise. Further, since 1 < p < 2, 
we have if § < y <x, 




















(¢—atin)™” — (in) |? 
s — (& + in) s— (o+ m)| 
§ (&-—)* +7)?” in|? \ 
Pp 
<5 Uj — &)? Tat Gro + (n — y)° hd eds | 


— 
2 


ial’ 
: ya 3 
(x — vy)? + @ — y))”” 
which is in L;}(— ©, ~) as a function of 7. Hence by Lebesgue’s theorem of 
dominated convergence, 


(E—w+in)*” (in) *” 


— (& + im) s — (w+ in) 


< op+t 





Dd 


dn = 0. 











lim 
t4.e+ —ow 





Thus, letting § - w+ we obtain from (6, § 12.5, example (iv))) 


1—2/p 
f(s) = Hf tm yr" flo + in} in) , " 


* _f@t+m™) | 
Qn ~5 — (w+ mm) 


Lemma 4. If f € %(w), g > 2, and if § > w and Res < &, then 


J f(— + in) _ dn 
2 s— €+ i) 


Proof. The statement follows much as in the previous lemma. 














436 P. G. ROONEY 


3. The spaces 4%;(w). Theorems 1 and 2 correspond to Theorems 2 and 3 
respectively of Doetsch (1). 


THEOREM 1. If e*'g(t) € L, (0, ©), 1 < p < 2, and 
fs) = fe" 6 a, Res > «, 
then f € Hj(w) and if x > w, 
v(f;x,w) <K foe | p(t) |? dt, 


where K depends on ? alone. 


Proof. If x > », 
fle — iy) = fee 60) ats 
0 


that is, for each fixed x > w, f(x — ty) is the Fourier transform of a function 
in L, (0, ~). Hence by (7, Theorem 80), since 1 < p <i2, 


vy(f; x, 0) = a — w + iy! |f(x + iy)? dy 
1 - , 
<ae J ble - ray 
< AP [Pe oc dt <2 fe op at, 


so that f € 4%(w) and the stated inequality holds with K = K(p)/2z. 


THEOREM 2. Jf f € %(w), q > 2, then there is a function $, with e~*'o(t) € 
L,(O, ~),*such that 


f(s) = e ** p(t) dt, Res> w. 
0 
Further, if x > w, 


| Kae | @(t)| “dt < K »,(f; x, w), 
0 


where K depends on q alone. 
Also for x > w and for almost all t, 


oa, tf eyes + in) dn on 5% 


(where 2, denotes the limit in mean of order q). 


t)t>O 
0,t<0 


Proof. By Lemma 1, |y|'~*/*f(w + iy) € L, (— ©, @). Hence by (7, Theorem 
79), f(w + ty) has a Fourier transform F € L, (—@, ~), given by the 
formula 














———— 


FUNCTIONS ANALYTIC IN A HALF-PLANE 437 


1 ity ° 
F(t) = &, os f(w + im) dn. 


ac 


Let o(t) = (24)-te'F(t). Clearly e*'¢(t) € L, (—@, @) 
EL, (—*, @) asa 


' Now for each s with Res # wa, (s — (w + im))™ 
function of 7. Also a straightforward calculation shows that if Res > w, 


[— (29)' e™ 4>0, Res <w 
(2x)! gr" 8< 0, Res > w, 


1 e" | 
@r) f; -@tm” | 
0, (Res — w)t> 0, 





so that the Fourier transform of ((s — (w + in))~' is given by this expression. 
Hence from Lemma 3 and (7, Theorem 81), if Res > w, 
2 feted ye hf" H- 
f(s) = of Byes my dn = Qs) x, F(—t) dt 


frevet ra dt = foe o(t) dt, 
0 0 


(2x)! 
so that f is the Laplace transform of a function ¢ with e~*‘¢(t) 
Also from Lemma 4 and (7, Theorem 81), if Res < a, 


m1 . 3 fw + in) _ er ] f t(s—w) = 
“5 By y= * ee (2a4)' Jo ° F(—#) dt 
l it st 
ae e-= —t) dl, 
(27) 0 " e 


that is, the Laplace transform of ¢(—?), with variable —s, vanishes. Hence 
by (2, chapter 2, § 9, Theorem 4) ¢(—?#) = 0 a.e. for t > 0, or equivalently 


EL, (0, @). 


o(t) = 0 a.e. for t < 0, 
Further, from (7, Theorem 79), 


_ t) | 1 . * 1 
(3.1) fre "| (t)|* dt = (Qn) ¥ Ji rora 


i Ff eae , 
< — £ ly|** [fw + ty)? dy. 





Now since g > 2, ifw < w’ < xandg € Hw), then »,(g; x, w’) < »,(g; x, w), 
so that g € 4%(w’). Hence if x > w, f € %(x) so that by what we have 
just proved there is a function ¢, with e~*‘¢,(t) € L, (0, ©), satisfying (3.1) 


with w replaced by x, such that for Res > x, 
f(s) = | as $,(t) dt, Res > x, 
0 


and so that for almost all ¢ 
et +f 19 yee >0 
. bal 2a J.” f(x + #9) da 0, t<0 














438 P. G. ROONEY 


But by (2, chapter 2, § 9, Theorem 4), ¢,(¢) = $(#) a.e. for ¢ > 0. Hence for 
any x > w and almost all ¢ 


. 1 om _ fo), t>0, 
fa, fe f(% + in) dy =) 0,1 <0. 


Finally from (3.1), with w replaced by x, we obtain, since g > 2, 


Semiomra = [em eota < XO [* yietye + iyyitay 


< Kv,(f;x, «), 
where K = K(q)/2z. 


4. The spaces “%,,(w). Theorems 3 and 4 correspond to Theorems 1 and 
2 of (4). 


THEOREM 3. If e*'o(t) € L, (0, ©), 1 <p <2, A>0, and 
f(s) = fever o(t) dt, Res > w, 
0 
then f € A ,(w). 


Proof. \f } = 0 the statement reduces to that of Theorem 1. Hence we 
may assume A > 0. If w’ > w, then since #e~—“’-»)' is bounded for ¢ > 0, 
e~*’ ‘Po(t) € L, (0, ©), and hence by Theorem 1 f € 4%(w’), and if x > w’ 


v(f;x,w) <K f oP | o(t)/? dt. 
0 
Let x > w, and choose w’ so that w < w’ < x. Then since 1 < p < 2, 
vy(f; x,w) < vp(f; x, a’) <K | ets | p(t) |? dt. 
0 


Hence 


»p(f; 0) = fe — wo)" v9 (f; x, w)dx 


<K Kc — w)”"* dx four | p(t) | de 


=K ; t | p(t) |? dt fe — wo)" €* dt = — ; e "| o(t)|? dt, 
and f € Ax ,(w). 


THeEorem 4. If f € 4% ,(w), gq > 2, \ > O, then there is a function ¢, with 
e*'o(t) € L, (0, ©), such that 


f(s) = fren? o(t) dt. 


Proof. Since f € A%((w’) for every w’ > w, by Theorem 2 if w’ > w there 
is a function ¢,, with 


e** bu (t) € L,0, ), 








ee 
_ ~ 


Sl 


¢ 
k 











FUNCTIONS ANALYTIC IN A HALF-PLANE 439 


such that 
Hs) = foe" beat, Res > wi 
0 
But by (2, chapter 2, § 9, Theorem 4), if w’ and w” are larger than w, 
du (t) = du (t) a.e. for t > 0. Hence if ¢ is any one of these functions and 


Re s > w, then choosing w’ so that w < w’ < Res we obtain 


f(s) = foe" du (t)dt = fe" o(t)dt. 


Also from Theorem 2, since g > 2, if x > wand w’ is chosen so that w < w’ < x 
few | o(t) \"dt = | | dur (t) "dt < K v,(f; x, w) < K »,(f; x, w). 
0 


Hence, if we multiply this inequality by (x — w)®-' and integrate from 
w to ©, we obtain 


i — w)* "dx je |po(t) “dt < K i — w)”" v6 (fj x, w)dx 
= Kv) (f; w). 


But the integral on the left-hand side of this inequality is equal to 


fe — w)? "dx | |bo(t) |“dt = J ieowsae [ce — woe dx 
~ om [ore | do(t) |* dt, 
so that 


m0 arp 
f et ™ | bo(t) (“dt < q Kr(f, w) < @, 


I'(gd) 
Hence if we let ¢(¢) = t¢o(t), then e~*‘g(t) € L, (0, ©), and if Res > w 


f(s) = fener (t)dt. 


REFERENCES 

1. G. Doetsch, Bedingungen fiir die Darstellbarkeit einer Funktion als Laplace-Integral und eine 
Umkehrformel fiir die Laplace-Transformation, Math. Zeit., 42 (1937), 263-86. 

2. , Handbuch der Laplace—Transformation. I (Basel, 1950). 

3. G. H. Hardy and J. E. Littlewood, Some new properties of Fourier constants, Math. Annal., 
97 (1926), 159-209. 

4. P. G. Rooney, On some theorems of Doetsch, Can. J. Math., 10 (1958), 421-30. 

5. E. C. Titchmarsh, A contribution to the theory of Fourier transforms, Proc. Lond. Math. 
Soc. (2), 23 (1923), 279-89. 

, The theory of functions (2nd ed.; Oxford, 1939). 

, Introduction to the theory of Fourier integrals (2nd ed.; Oxford, 1948). 








6. 
7. 





University of Toronto 











A NETWORK-FLOW FEASIBILITY THEOREM 
AND COMBINATORIAL APPLICATIONS 


D. R. FULKERSON 


1. Introduction. There are a number of interesting theorems, relative to 
capacitated networks, that give necessary and sufficient conditions for the 
existence of flows satisfying constraints of various kinds. Typical of these are 
the supply-demand theorem due to Gale (4), which states a condition for the 
existence of a flow satisfying demands at certain nodes from supplies at other 
nodes, and the Hoffman circulation theorem (received by the present author 
in private communication), which states a condition for the existence of a 
circulatory flow in a network in which each arc has associated with it not 
only an upper bound for the arc flow, but a lower bound as well. If the con- 
straints on flows are integral (for example, if the bounds on arc flows for the 
circulation theorem are integers), it is also true that integral flows meeting 
the requirements exist provided any flow does so. This fact has been used 
by Gale (4), and by Ford and Fulkerson (3), in the solution of several com- 
binatorial problems. For example, Gale has shown how the supply-demand 
theorem, together with the existence of integral flows, can be used to derive 
simple conditions for the existence of a matrix of zeros and ones having 
prescribed row and column sums, a problem that was also solved independently 
by Ryser (9) by means of purely combinatorial methods. 

The present paper adds some results along the lines we have described. We 
first establish a feasibility theorem, which may be described informally as 
follows. Suppose there is given a capacitated network with certain of the nodes 
designated as sources, others as sinks, and assume that each source is required 
to send, and each sink to receive, an amount that lies between prescribed 
bounds. Under what conditions is this possible? The theorem asserts that if 
(a) there is a flow that sends out of each source an amount at least as great 
as the lower bound for the source, and into each sink no more than the upper 
bound for the sink, and if (b) there is a flow that sends out of each source no 
more than the upper bound for the source, and into each sink at least as much 
as the lower bound for the sink, then there is a flow that meets all the require- 
ments simultaneously. We do not give a direct proof of this theorem, but 
rather use the max-flow min-cut theorem (1; 2) to find a pair of conditions 
that are necessary and sufficient for the existence of the required flow, and 
then observe that one of the conditions is equivalent to (a) above, the other 


to (b). 


Received July 28, 1958. 
440 











A NETWORK-FLOW THEOREM 441 


Our first combinatorial application (§ 5) of the feasibility theorem is to 
generalize the Gale-Ryser theorem on incidence matrices having prescribed 
row and column sums, to the extent of allowing these sums to vary within 
designated bounds. 

Our second application (§ 6) concerns the subgraph problem for directed 
graphs: to find necessary and sufficient conditions that a finite directed graph 
G have a subgraph H possessing specified local degrees. A solution to this 
problem has been givenby Ore (6). Here, in keeping with the feasibility 
theorem, we extend the problem by permitting the number of arcs of H that 
enter or leave each node of G to vary within bounds, and then show that 
the conditions obtained for this latter problem reduce to Ore’s conditions for 
the subgraph problem. 

The similar problem for undirected graphs, which has been solved by Tutte 
(12), and also by Ore (8), is, so far as we known, amenable to network-flow 
methods only in the special case that G is an even graph, and this because 
the problem then is, in essence, a directed one. 

Our final application (§ 7) deals with a problem involving set representatives: 
to find necessary and sufficient conditions for the existence of a system of 
distinct representatives having the further property that the intersection of 
the system with each member of a given partition of the fundamental set has 
a cardinality lying between assigned bounds. This problem was first posed 
and solved by Hoffman and Kuhn (5); it is shown here that the conditions 
established in (5) are deducible from the feasibility theorem. 


2. Definitions, notation, and rrior results. Let G be a finite directed 
network or linear graph consisting of a set N of nodes, x, y,..., and directed 
arcs joining pairs of nodes, the arc from x to y being denoted (x, y), and 
suppose that each arc (x, y) has associated with it a capacity c(x, y), where 
c(x, y) is either a non-negative real number or plus infinity. Let the set 
of nodes be partitioned into three subsets: S (the set of sources), T (the set 
of sinks), and R (the set of intermediate nodes). We cali a real-valued function 
f defined on the arcs of G a flow from S to T provided that 


© %, fen =F, 0.2) FER 
(2) 0<f(x,¥) < c(x,y), all (x, y) 


where A(x) (“‘after’’ x) is the set of nodes y such that (x, y) is an arc, and 
B(x) (“‘before’’ x) consists of those nodes y such that (y, x) is an arc. Thus 
(1) states that the flow out of an intermediate node is equal to the flow in, 
and (2) that the flow in each arc does not exceed its capacity. 

We will be interested in flows from S to T that satisfy bounds on the net 
flow leaving each x € S, and entering each x € T. Thus, for x € S, let a(x) 
and B(x) be real-valued functions with 


0 < a(x) < B(x); 














442 D. R. FULKERSON 


similarly, associate with each x € T two real numbers a(x) and (x), where 
0 < a(x) < d(x). 


The additional constraints 


(3a) a(x) < 2 fe») > f,*) < B(x), x €S, 
(3b) a(x) < X, f(y, x) 2d f(x,y) < d(x), x €T, 


will be termed feasible provided there is a flow f from S to T satisfying them. 
In this case, f will also be called a feasible flow. 

To simplify the notation, we adopt the following conventions. If X and Y 
are subsets of NV, denote by (X, Y) the set of arcs leading from X to Y; and 
for any fuction f defined on the arcs, let 


> f(x,y) =f(X, ¥). 


(z.y) (X.Y) 


Similarly, if a is defined on a subset X of N, let 


>} a(x) = a(X). 
We shall also use A(X) to denote the set of all nodes y such that (x, y), for 
some x € X, is an arc of G, and similarly for B(X). 
The value v(f) of a flow f from S to T is the net flow leaving the sources, 
which, in the notation just introduced, is given by 


(4) v(f) = f(S, A(S)) — f(B(S), §). 
In view of (1), »(f) may also be expressed as the net flow entering the sinks: 
(5) v(f) = f(B(T), T) — f(T, A(T)). 


Let X, X be a partition of N with SC X, T C X. The set of arcs (X, X) 
is a cut in G (separating S and 7), and c(X, X) is the cut capacity. 

A fundamental theorem concerning flows from S to T in a network G 
asserts that the maximal flow value is equal to the minimal cut capacity 
(1, 2). A second theorem, important for combinatorial applications, is that 
if the capacity function ¢ assumes only integral vlaues, then there exists a 
maximal flow f that is likewise integral (2, 3). 

Gale (4) has used the max-flow min-cut theorem to prove that if a(x) = 0, 
b(x) = @ in (3), then a feasible flow (that is, a flow satisfying the ‘‘demands’”’ 
a(x) at the sinks from the “‘supplies’’ 8(x) at the sources) exists if and only 
if, for every partition X, X of N, we have 


(6) a(T-X) < c(X, X) + B(S-X), 


where X - Y denotes the intersection of the sets X and Y. 














A NETWORK-FLOW THEOREM 443 


3. Feasibility theorems. In this section, we develop a generalization 
of the supply-demand feasibility theorem by finding conditions under which 
the full set of constraints (3) is feasible. 

We begin by adjoining to the given network G four new nodes, s, ¢, u, », 
and several sets of arcs, as follows: 


(s, S), (u,S), (7,8, (T,v), (u, 8, (s,0), (ts). 


Next, we extend the capacity function ¢ defined on arcs of G to the new 
network G* by 


c(s,x) = B(x) — a(x), x€S, 
c(u, x) = a(x), x€S, 
c(x, t) = b(x) — a(x), x € T, 
c(x,v) = a(x), x€T, 
c(u,t) = a(T), 

c(s,v) = a(S), 

c(t, s) = @, 


We assert that a feasible flow exists in G if, and only if, the value of a maximal 
flow from u to v in G* is a(S) + a(T). Suppose first that f is feasible in G; 
extend f to f*, defined on the arcs of G*, as follows: 


f*(s, x) = f(x, A(x)) — f(B(x), x) — a(x), x€é€S 
f*(u, x) = a(x), x € S, 
f* (x,t) = f(B(x), x) — f(x, A(x)) — a(x), x€T, 
f*(x,v) = a(x), x € T, 
f*(u,t) = a(T), 

f*(s,v) = a(S), 

f*(t, s) = f(S, A(S)) — f(B(S), S), 

f* (x, vy) = f(x, y), for arcs (x, y) of G. 


It is a routine matter to check that {* is a flow from u to v in G*. Clearly, f* 
has value 


v(f*) = a(S) + a(T7). 
Conversely, let f* be a flow from u to v in G*, of value a(S) + a(T). Then 


f*(u, x) = a(x), x€S, 
f*(x,v) = a(x), x € T. 


Let f be f* restricted to G. Then f is a flow from S to T in G, and it remains 
only to show that f is feasible. Consider any x € S. From (1) applied to x, 
we have 


f*(u, x) + f*(s, x) = f(x, A(x)) — f( B(x), x), 


or 


a(x) + f*(s,x) = f(x, A(x)) — f(B(x), x); 








444 D. R. FULKERSON 


and, since 
0 < f*(s, x) < B(x) — a(x), 
we get 
a(x) < f(x, A(x)) — f(B(x), x) < B(x), 


which is (3a). Inequalities (3b) are similarly proved. This completes the 
proof of the assertion. 

We may, therefore, in searching for feasibility criteria, rephrase the question 
as follows. Under what conditions does there exist a flow f* from u to v in G* 
having value v(f*) = a(S) + a(7)—that is, saturating all source and sink 
arcs? The max-flow min-cut theorem can now be used to provide an answer 
to this question by insisting that the capacitities of all cuts separating u 
and v be at least as great as a(S) + a(T). 

Thus, let (X*, X*) be a cut in G*, and consider cases. 

Case 1. s € X*, t € X*. Partition X*, X* as follows: X* =u+5+4+X, 
X*=v+2+4+X. Then 


c(X*, X*) = c(u, t) + c(u, X),+ c(s, v) + c(s, X) 
+ c(X,v) + c(X,t) + c(X, X) 
= a(T) + a(S-X) + a(S) + B(S-X) — a(S-X) 
+ a(T-X) + 6(T-X) — a(T-X) + c(X, X). 
Hence, in this case, we always have c(X*, X*) > a(S) + a(T). 


Case 2. s € X*, t € X*. Then c(X, X*) is infinite. Hence again no con- 
dition is obtained. 


Case 3. s € X*, t € X*. Letting X¥* =st+i+t+u+X, X*=0+X, we 
have 


c(X*, X*) = c(s, v) + c(s, X) + c(u, X) + c(X,0v) + c(X, X) 
= a(S) + B(S-X) — a(S-X) + a(S-X) 
+ a(T-X) + c(X, X). 


Thus c(X*, X*) > a(S) + a(T) if, and only if, 
(7) B(S-X) + c(X, X) > a(T-X). 
Case 4.5 € X*,t € X*. Let X* =u4+X, X¥*=s4+t+0+4+X. Then 


c(X*, X*) = c(u, t) + c(u, X) + c(X, t) + c(X,0) + c(X, X) 
= a(T) + a(S-X) + b(T-X) — a(T-X) 
+ a(T-X) + c(X, X), 


and we obtain the condition 
(8) b(T-X) + c(X, X) > a(S-X). 


We may therefore state the following result. 





= 














A NETWORK-FLOW THEOREM 445 


THEOREM 1. The constraints (3) are feasible if and only if (7) and (8) hold 


for all partitions X,X of N. 


Notice that (7) is precisely condition (6) for the supply-demand case; that 
is, if a(x) = 0 for x € S, and b(x) = @ for x € 7, then Theorem 1 reduces 
to the supply-demand theorem of (4). Condition (8) may be interpreted as 
follows. If we interchange sources and sinks in G, reverse all arc directions, 
and think of a as the demand function at the set S of sinks, 6 as the supply 
function at the set T of sources, then (8) is a necessary and sufficient con- 
dition for feasibility of the supplies and demands in the reversed network. 
Thus Theorem 1 may be restated as follows. 


THEOREM 2. The constraints (3a) and (3b) are jointly feasible if, and only 
if, the constraints 


. fa(x) < f(x, A(x)) — f(B(x), x), z€ 53S, 
9) 
\ Uf (B(x), x) — f(x, A(x)) < d(x), # € Zz, 
and 

10 jf (x, A (x)) — f(B(x), x) < B(x), zx € 53, 
(10) a(x) < f(B(x), x) — f(x, A(x)), x€T, 


are separately feasible. 


Theorem 2 is the formulation described verbally in the Introduction. One 
suspects that there should be a simple method of constructing a flow satisfying 
all the constraints from the two separate flows, but we have not found such 
a method. 

We note one other fact for the combinatorial applications. Namely, if the 
functions a, 8, a, b, and ¢ are integral-valued, and if the constraints (3) are 
feasible, then there is an integral feasible flow f. This follows directly from the 
proof of Theorem 1 and the existence of integral maximal flows in networks 
having integral capacities. 


4. Application to matrices. When the network G is suitably specialized, 
Theorem 2 (or Theorem 1) provides criteria for the existence of a non-negative 
matrix whose row and column sums lie between designated limits, or, more 
generally, for the existence of a matrix with this property and the further 
property that the elements of the matrix are bounded above by specified 
numbers. We state the criteria provided by Theorem 2 explicitly as follows: 


THEorEM 3. Let 0 Ca; < 8; t= 1,...,m, 0 Qa, <b, j = 1,...,28, 
and cy > 0 be given constants. If there are matrices f';,, f*;, satisfying 


(11) asf 2, Sen 2. SaaS be 0 < fis < Cuy, 
4 i 


(12) > far < Bo a;< 2, Sem 0 < fis < Cay, 
I i 











446 D. R. FULKERSON 


then there is a matrix f ,; satisfying 
(13) a< 2 fu < Bo 6:4 2, fuss 0 < fis < Cty. 


To prove Theorem 3, take G to be the network consisting of nodes x, 
(¢=1,...,m), vy; (G=1,...,m), and arcs (x;,y,) of capacity c,,. Let 
S = {x1,...,%m}, T = {91,..., 9a}, so that R is vacuous. Associate with 
each source x, the bounds a;, 8;, and with each sink y, the bounds ay, by. 
Then a flow from S to T is a matrix f;, satisfying 0 < fi; < cy; a feasible 
flow satisfies, in addition, the first two inequalities of (13). Thus Theorem 3 
is a direct consequence of Theorem 2. 


5. Incidence matrices. Gale (4) and Ryser (9) have found simple 
conditions for the existence of a matrix of zeros and ones having prescribed 
row and column sums—or, what is the same thing, for the existence of an 
incidence matrix whose row sums are bounded below by given integers and 
whose column sums are bounded above by given integers. 

The following is one interpretation of their problem. Suppose there is given 
a finite set E = {e:,..., @m}. Under what conditions on the sets of integers 
{ar,...,@m} and {b;,..., 5,} is it possible to construct m subsets E,,..., E, 
of E such that (a) the number of sets E, that contain the element e, is at 
least a;, and (b) the set E, contains at most 4, elements? 

The conditions are surprisingly simple. Arrange the a's in decreasing order, 


Qi PD Aig DP... DP Aims 


and define o, to be the number of integers in the set of b’s that are greater 
than or equal to k. Then the required incidence matrix exists if, and only if, 
we have 


(14) D au< Lo 1=1,2,..., 
k 


where we take a, = 0 for k > m. 
As a corollary of the Gale-Ryser condition (14), Theorem 3 with all c,, = 1, 
and the remark at the end of §3, we have the following result: 


THEOREM 4. There exists a matrix of zeros and ones for which the ith row 
sum lies between given non-negative integers a; and 8,, and the jth column sum 
lies between given non-negative integers a, and b,, where a; < Bi, ay < by, #f, 
and only if, 


I I 

(15) ey an < > Crs J = 1,2, ‘ 
ke k= 
I i 

(16) a ay, < ey Tk l = i, 2, ’ 
he ee 


where 























LT So 








A NETWORK-FLOW THEOREM 


i, PD Aig Dw. s DP Aims Ay, PD Aj Pies D Aj 


and o, is the number of b's, and rt, the number of 8's, that are greater than or 
equal to k. 


6. The subgraph problem. Let G be a finite directed graph, and let 
e(x) and i(x) be, respectively, the number of arcs entering and the number 
of arcs issuing from node x. Then the (local) degree of G at x is defined to 
be the pair e(x), 7(x). 

The subgraph problem is the problem of determining conditions under 
which G has a subgraph H having prescribed local degrees. We consider the 
following generalization of this problem. Associate with each node x € N 
four integers a(x), b(x), a(x), B(x), satisfying 


(17a) 0 < a(x) < b(x), 
(17b) 0 < a(x) < B(x), 


and determine conditions under which G has a subgraph H with local degrees 
é#(x), tq(x) satisfying 


(18a) a(x) < eg(x) < b(x), 
(18b) a(x) < ig(x) < B(x). 


To find such conditions, we convert the problem to a flow problem and 
apply Theorem 1. First construct from G a new directed graph G’ having 
twice as many nodes as G but the same number of arcs: to each node x of G 
correspond two nodes x’, x”’ of G’; if (x, y) is an arc of G, then (x’, y’’) is an 
arc of G’ and these are all the arcs of G’. Assign unit capacity to each arc of 
G’. In G’, let Sand T be the set of primed and double primed nodes, respectively. 
Next impose, for each x’ € S, the condition (3a) that the flow out of x’ lie 
between a(x) and 8(x); similarly, for x” € T, insist that the flow into x” 
lie between a(x) and 6(x). 

It is clear that an integral feasible flow f from S to T in G’ singles out a 
subgraph H of G satisfying (18) simply by putting (x, y) in H if and only 
if f(x’, y’’) = 1. Conversely, of course, a subgraph H satisfying (18) produces 
an integral feasible flow in G’. Hence, if we let U, V be arbitrary subsets of 
S, T, respectively, and denote their respective complements in S, T by U, V, 
it follows from Theorem 1 and the existence of integral feasible flows that H 
exists if, and only if, 


(19a) 8(U0) + |(U, V)| > a(V), 
(19b) b(V) + |(U,V)| > a(V), all UC S, VC T, 
where | | denotes cardinality. 


Before proceeding further, let us consider inequalities (19) in the special 
case for which a(x) = b(x), a(x) = 8(x)—that is, in the case for which the 








448 D. R. FULKERSON 
local degrees of H are specified exactly. Then a necessary condition for H 
to exist is that a(V) = b(N), or, in GC’, 
(20) a(S) = b(T). 
On the other hand, (20) and (19b) now imply (19a), since 

a(U) + |(U, V)| > a(U) + a(U) — B(V) = a(S) — B(V) 

> b(T) — b(V) = 4(D), 

which is (19a) with a = 8, a = Bb. 


Thus, (20) and (19b) are necessary and sufficient for the existence of a 
subgraph H having local degrees eg(x) = b(x), ig(x) = a(x). 

Each of the conditions (19a), (19b) is stated in terms of selections of pairs 
of sets. Each can, however, be simplified to a condition involving the choice 
of but one set. Consider (19b), for example. For given U C S, let 

V = {y” € T| by”) < | (U,y")}}. 


For this pair U, V, the left-hand side of (19b) may be written as 


DD min [b(y”), |(U, y’”’) |]. 


vy’ €A(U) 
On the other hand, for fixed U C S, this sum clearly minimizes 6(V) + |(U,V) 


over all V C 7. Thus inequalities (19b) are equivalent to the inequalities 


(21) > min [b(y’’), |(U, y’’)|] > a(U), alU CS. 


vy’ €A(U) 
Similarly, (19a) reduces to 
(22) d_min [8(y’), |(y', V)|] >a(V), all VC. 
vy’ «B(V) 
Thus, translating (21) and (22) to conditions stated in terms of the given 
graph G, we have the following theorems: 


THEOREM 5. Let G be a finite directed graph with node set N, and suppose 
that, corresponding to each x € N, there are integers a(x), b(x), a(x), B(x) with 


0 < a(x) < d(x), 
0 < a(x) < B(x). 


Then G has a subgraph H whose local degrees eq(x), ig(x) satisfy 


a(x) < eg(x) < b(x), 
a(x) < tg(x) < B(x), 


if, and only if, for all X C N, we have 


(23) a(X) < > min [o(y), |(X, y) |], 
(24) a(X)< ay min [8(y), |(y, X)|]. 








ea nn -« 








A NETWORK-FLOW THEOREM 449 


THEOREM 6 (Ore). The finite directed graph G has a subgraph H with local 
degrees 


ex(x) = b(x) > 0, 
ig (x) = a(x) > 0, 
if, and only 1f, 
(25) a(N) = 5(N) 
and, for all X CN, 
(26) a(X) < 2) min [6(y), |(X, y) |]. 


As a consequence of Theorem 2, we may also state the following result: 
THEoreM 7. If the finite directed graph G has subgraphs H,, Hz, such that 


a(x) < eg,(x), tw,(x) < B(x), 


én, (x) < b(x), a(x) < ty, (x), 
where 0 < a(x) < b(x), 0 < a(x) < B(x), then G has a subgraph H such that 
a(x) < eg(x) < d(x), a(x) < ty(x) < B(x). 


For undirected graphs G, the (local) degree of G at x is the number of arcs 
incident with x, and the subgraph problem is to determine conditions under 
which G has a subgraph H with prescribed local degrees. In case G has only 
even cycles, so that the nodes of G can be partitioned into two sets S, T such 
that all arcs join nodes of S to those of 7, the subgraph problem can be 
stated as a flow problem in G, and hence Theorem 1 can be applied. We know 
of no way, huwever, to make use of flow theory in the general case. 


7. Systems of representatives. In our applications of the feasibility 
theorem thus far, the set R of intermediate nodes has been vacuous. We 
conclude with an application, suggested to us by Gale (4), in which this will 
not be the case. 

Let E;,...,E, be subsets of a given set E = {e,..., em}. A list 


ee! 


of n distinct elements of E, such that ey, € E,, is a system of distinct repre- 


sentatives for E;,..., E,, in which e;, represents E,. (A well-known theorem 
of P. Hall gives necessary and sufficient conditions for the existence of a 
system of distinct representatives.) Suppose, in addition, that P;,..., P, is 


a partition of EZ, and that it is desired to establish existence conditions for a 
D such that the intersection of D with each P, has cardinality between 
prescribed bounds. Hoffman and Kuhn (5) have used the duality theorem of 
linear-equality theory, applied to a linear-programming problem of trans- 
portation type, to prove the following theorem: 











450 D. R. FULKERSON 


THEOREM 8 (Hoffman-Kuhn).* Let a, and 8, k = 1,2,..., p, satisfying 
0 < a < By, be integers associated with a partition P,,...,P, of a given set 
E = {e:,...,€m}. The subsets E,,...,E, of E have a system of distinct 
representatives D satisfying a, < |D-P,| < By, k = 1,..., p, if, and only if, 


(27) (2 Ps) (2 2)| > IVl- & & 


keU 
(28) (2 Ps) (2) > IVi-m+ Z om, 


hold for all subsets U C {1,..., p} amd VC {l,...,n}. 


To establish (27) and (28) as necessary and sufficient conditions for the 
existence of the required system of distinct representatives, we set up the 
following feasibility problem. Let 


S = [xs,..., Xp); 


R |, —_—a, =e 
a ae 2 


be the nodes of a network G, and define arcs in G as follows: 


(xx, ¥x) is an arc if, and only if, e, € P,, 
(%:, 2,) is an arc if, and only if, e; € E,. 


The capacity function is taken to be 


c(xz, ¥;:) = 1, 
C(yi, Z,) = &, 


With each x, € S, associate the bounds a, 8, on the flow leaving x,, and 
similarly require that the flow into z, € T be precisely unity (a(z,) = b(z,) =1). 

From the definition of the capacity function and the assumption that 
P;,...,P, is a partition of E, it follows that the amount of flow through 
each node y,; € R is at most one. Thus an integral feasible flow f from S to T 
picks out a set D fulfilling the hypotheses of the theorem: 


D = fedf(S, ¥:) = f(y, T) = 1}. 


Conversely, given a D satisfying the assumptions of the theorem, we can 
define an integral feasible flow f by 


if e, € D-P,, 


= i! 
f(%es 99) \0 otherwise; 


if e, represents E,, 


a2! 
fw 2,) (0 otherwise. 


*It is also stated in (5) that the authors have not been able to prove this result without 
using the duality theorem. However, Gale has recently shown that Theorem 8 is a consequence 
of the circulation theorem. It is therefore not surprising that the result can bc deduced from 
our Theorem 1. 








th 














A NETWORK-FLOW THEOREM 451 


Thus the feasibility problem in G is equivalent to the existence of a D meeting 
the requirements of the theorem, and we may consequently apply Theorem 1. 
Let X, X be a partition of the nodes of G, and set 


S-X =U, R-X=W, TX 
S:‘X=0U, R-X 

Then (7) and (8) become 
(29) 8(U) + c(X, X) > | VI, 
(30) \V| + ¢(X, X) > a(V), 


V, 
W, TX = V. 


respectively. Since c(y,, z;) = ©, these conditions hold automatically unless 
(X, X) contains no arcs from R to T. Thus we may restrict attention to 
partitions X, X such that B(V) C W, so that c(X, X) = c(U, W). But since 
the right-hand sides of (29) and (30) are independent of W, it suffices to 
select W = B(V). Then we have 


c(X, X) = c(U, B(V)) = |A(U)-B(YV)}. 


Consequently, a feasible flow from S to T exists if, and only if, 


(31) A(U)-B(V)| > |V| — 8(0), 

(32) A(U)-B(V)| >a(U) —|V|, alUCS, VCT. 

Replacing |V| by m —|V| in (32) and translating (31) and (32) into set- 
theoretic statements yield (27) and (28), respectively. Thus (27) is a necessary 


and sufficient condition that there be a system of distinct representatives D 
such that |D-P,| < 8, whereas (28) is a necessary and sufficient condition 
that there be a system of distinct representatives D with |D-P,| > ay. 


REFERENCES 
1. L. R. Ford, Jr. and D. R. Fulkerson, Maximal flow through a network, Can. J. Math., 
8 (1956), 399-404. 


2. ——— A simple algorithm for finding maximal network flows and an application to the 
Hitchcock problem, Can. J. Math., 9 (1957), 210-18. 
3. ———— Network flow and systems of representatives, Can. J. Math., 10 (1958), 78-85. 


4. D. Gale, A theorem on flows in networks, Pac. J. Math., 7 (1957), 1073-82. 

5. A. J. Hoffman and H. W. Kuhn, On systems of distinct representatives, in H. W. Kuhn 
and A. W. Tucker (eds.), Linear inequalities and related systems, Annals of Mathematics 
Study No. 38, (Princeton, 1956). 

. O. Ore, Studies on directed graphs, I, Ann. Math., 63 (1956), 383-406. 

Studies on directed graphs, II, Ann. Math., 64 (1956), 142-53. 

Graphs and subgraphs, Trans. Amer. Math. Soc., 84 (1957), 109-37. 

. H. J. Ryser, Combinatorial properties of matrices of zeros and ones, Can. J. Math., 9 
(1957), 371-7. 

10. W. T. Tutte, The factorization of linear graphs, J. Lond. Math. Soc., 22 (1947), 107-11. 

11. ———— The 1-factors of oriented graphs, Proc. Amer. Math. Soc., 4 (1953), 922-30. 

12. ——— The factors of graphs, Can. J. Math., 4 (1952), 314-29. 








The Rand Corporation 











THE REGULAR MAPS 
ON A SURFACE OF GENUS THREE 


F. A. SHERK 


Introduction. A considerable volume of research on the theory of regular 
maps is now in existence. Systematic enumerations of regular maps on the 
surfaces of genus 1 and 2 were begun by Brahana (1; 2) and completed by 
Coxeter (6; 7, p. 141). In addition Coxeter enumerated the regular maps on 
the simplest non-orientable surfaces (7, pp. 116, 139), and constructed tables 
of some interesting families of regular maps (3; 7, p. 140). 

Most of the regular maps on a surface of genus 3 have appeared in these 
papers, but no systematic enumeration of them seems to have been attempted. 
The ultimate goal of this paper is a complete list of these regular maps. How- 
ever, the families of maps {j-p, g} and {7-p, j7-q} which are defined in § 4 and 
listed in Tables I and II are of considerable interest in themselves. Also of 
some importance is the complete list of regular maps of type {p, 3} with six 
or fewer faces (§5 and Table III). 

A method of deriving regular maps by identification of faces in a regular 
tessellation is introduced in § 2 and used in §§ 5 and 7. Although cumbersome 
in some cases, it is the only reliable tool which has yet been developed for 
completing a list of regular maps of genus p > 1 (Brahana’s method (2, pp. 
281-4) is dependent upon the completeness and accuracy of permutation 
group tables). 


1. Elementary concepts and results. A map is a partitioning of an 
unbounded surface into N, simply-connected, non-overlapping regions called 
faces by means of N, lines called edges. The No intersections of the edges are 
called vertices. 

The Euler-Poincaré characteristic 


1.1 x=No-—-Ni4+N:2 
has the same value for every map drawn on this surface. If the surface is 


orientable, then 
13 x = 2 — 2p, 


where p is the genus of the surface. 


Received July 25, 1958. The bulk of these results form a part of the author’s Ph.D. thesis, 
which was written under the supervision of Professor H. S. M. Coxeter. The author wishes to 
express his sincere thanks to Professor Coxeter for his kind and generous help. 

The research was carried on while the author held a research studentship from the National 
Research Council of Canada. 


452 








‘ 


bn | 


al 





REGULAR MAPS 453 


To every map there corresponds a dual map having N» faces, one sur- 
rounding each vertex or the original map, N, edges, one crossing each edge 
of the original map, and N, vertices, one contained in the interior of each 
face of the original map (5, p. 6). 

With any map there is associated a group of transformations which leave 
the map invariant and preserve incidences, that is, a group of automorphisms 
(7, p. 100). An automorphism is determined by its effect on any one face. 
Suppose that the group contains, in particular, two automorphisms RX and S, 
the first of which cyclically permutes the edges bounding a face F, while 
the other cyclically permutes the edges which meet at a vertex V of F. A 
map containing these two automorphisms is said to be regular. 

It is immediately evident that if the face F is p-sided, and if g edges meet 
at the vertex V, then every face of the regular map is p-sided and exactly q 
edges meet at every vertex. Thus the regular map is composed of p-gons, ¢ 
meeting at each vertex. Such a map is said to be a “map of type {/, q},”’ in 
analogy with Schlafli’s notation for a regular polyhedron (5, p. 14). The dual 
map is of type {g, p} and is, of course, also regular. It also follows from the 
definition of a regular map that the group of the map is transitive on its 
vertices, edges, and faces. 

Suppose that we divide the surface of the regular map of type {p, g} into 
pN» triangles by adding to the map the lines which join the vertices of each 
face to the corresponding vertex of the dual map (cf. Figure 1 for the case 
of a map of type {6, 3}). Thus each face of the map is made up of triangles, 











Ficure 1 











454 F. A. SHERK 


each edge borders on 2 triangles, and each vertex is surrounded by 2g triangles. 
It follows that 


1.3 PNe = 2N; => qNo. 
Accordingly if the map has N, edges it has 2N,/p faces and 2N,/q vertices. 
Substituting in formula 1.1, we have for the surface of the regular map: 


~~. oo 
1.4 x= an(141_ 1), 


We define the group of a regular map to be the group which is generated 
by the automorphisms R and S. Examining Figure 1, we note that the auto- 
morphism* RS interchanges the triangles OAB and PAB. Thus RS is of 
period 2. It is easy to see that this result is true for any regular map; the 
group of a regular map of type {p, g} must satisfy the relations 


1.5 R? = S‘ = (RS)? = E, 


where E denotes the identity element. These relations are sufficient to define 
the group if the surface is simply-connected, but in any other case at least 
one extra relation is needed. 

Looking again at Figure 1, we note that the edge AB is carried into itself 
by two automorphisms in the group, namely E and RS. When the surface 
on which the map lies is non-orientable, the group contains two other auto- 
morphisms which carry AB into itself. One of these will leave A and B in- 
variant, interchanging O and P, while the other leaves O and P invariant and 
interchanges A and B. These automorphisms are called reflections since they 
operate in a manner analogous to the reflections of the Euclidean plane 
(5, p. 75). Since the group is transitive on the edges of the map it must be 
of order 4N,. 

Any regular map whose automorphisms include reflections is said to be 
reflexible (7, p. 101). Certain non-reflexible regular maps do exist. Coxeter 
(6, p. 26; 7, pp. 103, 107) exhibited the non-reflexible regular maps on a 
surface of genus 1 and stated that no others were known (7, p. 102). However, 
Frucht (9) discovered a non-reflexible regular map on a surface of genus 55 
which is the embedding in that surface of a one-regular graph of degree 
three. Any non-reflexible regular map must lie on an orientable surface, since 
the group of a regular map on a non-orientable surface must contain reflections 
(7, p. 101). 

If the map is on a non-orientable surface, or if it is non-reflexible, the 
group of the map is the complete group of automorphisms. Every map which 
is reflexible and lies on an orientable surface has a larger group of automor- 
phisms which we shall call the extended group of the map (4, p. 125). The 
extended group includes reflections and is therefore of order 4N,. It contains 
“the group of the map” as a subgroup of index 2. 


*By RS, the product of R and S, we mean the automorphism which is achieved by per- 
forming R first and then performing S. 





—— Seg ee, 











REGULAR MAPS 455 


The automorphisms that comprise the group of an orientable regular map 
are called rotations. By 1.3 the order of the group may be expressed in the 
forms PN» or gNo as well as in the form 2. 

In virtue of relations 1.4 and 1.2, any regular map of type {p,q} which 
has N, edges and is on an orientable surface is on a surface of genus 


. a: oa oe 
1.6 p=1 w(t 4! a). 

The expressions ‘“‘regular map on a surface of genus p”’ will now be shortened 
to ‘regular map of genus p.”’ 

The regular maps of genus zero are simply the projections on concentric 
spheres of the 5 convex regular polyhedra, {3,3}, {4,3}, {3,4}, {5,3}, and 
{3, 5}. together with the ‘‘dihedral’’ maps {p, 2} ( > 1) and their duals 
2, p}. The groups of the regular maps of genus zero are the well-known 
polyhedral rotation groups (11, pp. 10-20; 5, pp. 45-7), denoted by the 
symbols [p, g]* (7, p. 38), whose abstract definitions are given by 1.5 with 
appropriate values for » and qg. Thus in the case of the regular maps of genus 
zero, the relations 1.5 are sufficient, as well as necessary, to define the group. 


2. The regular tessellations. The above description of a regular map 
can be extended to include regular maps on an infinite surface. Thus, for 
example, we have in the Euclidean plane the regular maps {4, 4}, {6,3}, and 
{3, 6}, more commonly called regular tessellations (5, pp. 58, 59). There are 
also regular tessellations in the hyperbolic plane (7, p. 53); they are of type 
{p, g} for all p and gq such that (p — 2) (¢ — 2) > 4. The regular tessellations 
on the sphere are just the regular maps of genus zero. All regular tessellations 
are simply-connected maps. 

As in the case of the regular maps on a sphere, the relations 1.5 are sufficient 
to define the group of a regular tessellation. It follows that the group of a 
regular map of type {~, g} on an orientable surface is a factor group of the 
group of the regular tessellation {, g}. It is also true that the plane of the 
tessellation is a universal covering surface for the surface in question (7, 
pp. 25, 26). These facts suggest a method of discovering regular maps. Begin- 
ning with a regular tessellation {p,q} we add further relations to those of 
1.5 by abstractly identifying certain faces of the tessellation (the exact 
procedure in this step will be outlined in the proof of Theorem 3). If the 
added relations do not effect the periods of R, S, and RS, and if they are 
sufficient to make the resulting group finite, let us say of order g, a regular 
map of type {p,q} has been discovered. It has g/q vertices, g/2 edges, and 
g/p faces. It lies on a surface of genus 


Rie. oe 
eal) 
&/ p* q 2 


(cf. 1.6). 














456 F. A. SHERK 


Furthermore, the above method will establish the existence or non-existence 
of all regular maps of type {p, g} with given group order. 


3. Some general lemmas. The following lemmas form the essential 
groundwork for all our results. 


LemMA 1. For any map of type {p, q} on a surface of Euler-Poincaré charac- 
teristic x <1, min (p,q) > 3. 


Proof. Consider a map of type {~, ¢} which has & faces. It follows from 
1.3 and 1.1 that 
— bk _ pk 
x= - 9 + k. 


Rearranging this equation, we obtain 
k — x = pk (q — 2)/2¢. 


If g < 2, then k — x <0 and x > & > 1. Thus if x < 1, g > 3. A similar 
argument holds for p when one considers the dual map, of type {gq, p}. 


Lemma 2. If two edges belonging to the same face of a regular map are identified, 
the map has only one face. 


Proof. We noted earlier that the group of a regular map is transitive on 
the edges of that map. Thus if two edges of a face are identified, then all 
the other edges of that face are also identified in pairs; the result is a one- 
faced map. 


LemMMA 3. If exactly two distinct faces come together at a vertex of a regular 
map of type |p, q}, the map is 2-faced, q is even, and the faces alternate around 
the vertex. 


Proof. lf a face is contiguous to itself around a vertex, then by Lemma 2 
the map is one-faced, contrary to our hypothesis. Thus g is even and the 
faces, a and 8 say, which surround a vertex alternate around that vertex 
(cf. Figure 2, where a and 8 surround the vertex V). Now consider any edge 
VV’ (Figure 2). This edge borders on @ and 8, and hence a and £€ alternate 
around V’ as well as around V. This happens at every vertex since the group 
of the map is transitive on its edges. Therefore a and 6 are the only faces. 


LemMA 4. A one-faced map of type |p, q} is regular if, and only if, one of the 
following two conditions 1s satisfied: 
(i) 3p ts an even integer and q = p; 
(ii) 3p is an odd integer and q = }p. 


Proof. The single face of a one-faced map must have an even number of 
edges since these edges are identified in pairs to form the edges of the map. 
Thus p = 2n, where n is some integer, and the group of the one-faced regular 
map {2n,q} is the cyclic group of order 2m generated by the rotation R of 











REGULAR MAPS 457 








V 


FiGurE 2 


§ 1. Now the group of any regular map may be expressed in terms of the 
generators R and T = RS instead of the R and S used earlier (2, p.269). The 
three relations of 1.5 are then equivalent to 

3.1 R? = T? = (RT)* = E. 


In the present case, T must be expressible in terms of R, and since 7? = E, 
T = R". The existence of a regular map of type {2n, q} depends upon the 
period of RT, which must be g. But RT = R**'. Hence if the map is regular 


(RT)* = RUD) = E = R™, 


and therefore 2n|q(n + 1). Now (n,n + 1) = 1, so that if m is even, 2n | q, 
while if ” is odd, m | g. Since the map has only m edges, g < 2n. Thus if m is 
even, g = 2n, while if m is odd, 

(RT)* = RO+D*® = (R*)te+n = EF, 


and thus g|. But m|q, therefore gq = n. Conversely, any one-faced map 
of type {4p, 4p} or {4p + 2,2p + 1} (p = 0, 1,2,...) is regular. 

It is easily seen from 1.6 that the above two one-faced regular maps lie 
on a surface of genus p. 


LemoMaA 5. If the rotation R*(1 < h < p, where p is the period of R) carries a 
vertex, edge, or face of a regular map into itself, while any rotation R‘(0 <i < h) 
does not do so, then p = 0 (mod h). 


Proof. The integer » may be put into the form 
p=mh+n 


where m and nm are integers and 0 < n < h. Since both R® and R’ (= E) 
carry the vertex, edge, or face into itself, so also must R". Therefore n = 0. 











458 F. A. SHERK 


LemMA 6. The abstract definition* 
3.2 R® = St = (RS)? = E, R2s (0 < (p — 2)(q — 2) < 4) 
is significant only if j | Q, where Q = 4q/[4 — (p — 2)(q — 2)], and then defines 
a group of order jpQ. 

Proof. lf we exclude the first relation in 3.2 we have 
3.3 St= (RS=E, Ras. 
If SR = T, this becomes 

T? = S = E, (TS)? = (ST)?. 

These relations define the group ((2,q¢| »)) of order PQ*, which was intro- 
duced by Coxeter and Moser (7, p. 79). The period of R (= TS-') is pQ 
(7, p. 71) and therefore the abstract definition 3.2 is significant only if this 
period is a multiple of jp. If we add to 3.3 the relation R” = E, where j | Q, 
the only effect is to change the period of R to jp; the periods of S and RS will 
remain unchanged. Now it is easily shown that the number of cosets of { R} 
in 3.2 remains the same, no matter what the particular choice of j is. When 
j = 1, the group is [p, g]*, the group of the regular map {p, g} and {R} has 


Q cosets (7, p. 38). Thus the group defined by 3.2 has order jpQ. 
To the group defined by 3.2 or 


(TS? = (STP =Z, T=St=Z/=E, 
we assign the symbol ((2,q| p;7)). In particular, ((2,q| p;1)) = [p, q]*. 
4. Two new families of regular maps. Coxeter and Moser (7, § 8.8) 


introduced the regular map {p + p, g} (0 < (p — 2)(q — 2) < 4) and its dual 
{qg, + p}, whose group has the abstract definition. 


R® = T? = (RT)* = (RT)? = E, 
RT, 
R® =S¢=(RS)!*=E, R2S. 


or, in terms of R and S 


We generalize this notion by considering the regular map of type {jp, g} 
(0 < (p — 2)(¢ — 2) < 4) and its dual, of type {¢, jp}, whose group G has 
the following property: the centre of G is a cyclic group, generated by R’, 
where R has its usual meaning as a generator of the group. Such a group G 
will satisfy the following four relations: 


4.1 R” = St = (RS)? = E, R=S. 


By Lemma 6, these relations are significant only if 7 | Q@, where Q = 4¢/[4 — 
(p — 2)(¢ — 2), and then they define the group ((2,q| p;7)) of order jpQ. 


*The notation A = B means that A and B commute. 











—_— ee = ONCE 











REGULAR MAPS 459 


Now the central quotient group of G is the group [p, g]* of order pQ. Thus G 
is of order jpQ, which is precisely the order of the group ((2, q | p; 7)). There- 
fore the relations 4.1 define G. 

To the above map of type {jp, g} we assign the symbol {7-, g}, and denote 
its dual by {g, 7-p}. The map occurs for all integer values p > 2, g > 2 and 
j > 0 satisfying the two conditions 0 < (p — 2) (¢ — 2) < 4 and j| Q. Its 
group is ((2,q| p;j)). 

In particular, the maps {2-p,¢} and {g¢,2-p} are the {p+ p,q} and 
{q, 2 + p} respectively of Coxeter and Moser (7, § 8.8) who pointed out 
that these maps may be drawn on a two-sheeted Riemann surface of the 
proper genus in a remarkably symmetrical manner. This construction is 
capable of generalization to the case of the regular maps {j.p, g} and {q¢, 7. p} 
(j ¥ 2). 

In the proof of Lemma 6 it was shown that when j = Q, the groups 3.2 
aré the groups ((2,q| p)) of order pQ*. They were shown by Coxeter and 
Moser (7, pp. 79-80) to be the groups of the regular complex polygons 2{2}q, 
discovered by Shephard (12, p. 92). When these complex polygons are com- 
pared with the corresponding regular maps {pQ-q, p}, it can be shown by 
the proper interpretation of the group generators in each case that the vertices 
and edges of the polygon form the same graph as the vertices and edges of 
the map. Thus the map may be regarded as a real representation of the 
complex polygon. 

Generalizing in another direction, we consider the regular map of type 
{jp,jq} (0 < (p — 2) (¢ — 2) < 4) and its dual, of type {jqg, jp}, whose 
group G’ has the following properties: the centre of G’ is a cyclic group 
generated by R’; and R? = S*, where R and S have their usual meanings as 
generators of G’. Such a group will have among its defining relations the 
following: 


4.2 R? = St = Z, (RS)? = Z/ = E. 


These relations are significant only if 


iP+q 
J : Q, 


where Q = 4q/[4 — (p — 2) (g — 2)], and then they define the group 
(p, g | 2; 7) of order jpQ (7, pp. 71-3). This is a factor group of Miller’s group 
(p,q | 2) which is defined by the relations 


R? = S, (RS)? = E. 


Now the centre {R?} of G’ is of order j, and the central quotient group of G’ 
is [p, g]*, of order pQ. Thus G’ is of order jpQ, which is precisely the order 
of the group (p,q | 2; 7). Therefore G’ is defined by 4.2. 

To the above type of map whose group is (p, g | 2; 7) we assign the symbol 
{j-b,7-q}, and denote its dual by {j-g, 7-p}. The map occurs for all integer 








460 F. A. SHERK 


values p < 2, g > 2 and j > O satisfying the two conditions 0 < (p — 2) 
(q — 2) < 4 and 


P= ¢ 


The members {(r + 1)-(r — 1), (r + 1)-2} of this family were noted by 
Coxeter and Moser (7, p. 114). 

The regular maps {, g} of genus zero are members of the family {7-, q} 
as well as of the family {j-p, j-q}. With these exceptions, all the regular maps 
{j-P, g} and {j-p,7-q} (p > qg) are listed in Tables I and II respectively. The 
sixth column in both tables exhibits some interesting isomorphisms between 
the group of the map and certain well-known groups. The information for 
the sixth column of Table I was kindly supplied by W. O. J. Moser; in the 
case of Table II the source is 7, § 6.6. 


TABLE I 
THE REGULAR Maps {j-~,q¢} (j > 2) 














Map No M N; Genus Group Order 
{7-2,¢} Gig & iq q 4G-1)(q-2) ((2, q | 2;3)) 2iq 
{q-2, q} 2q ¢ q 4q—1)(q¢—-2) ((2, q | 2)) 2¢? 
{2-p, 2} 2p 2p 2 0 ((2,2| p)) = Day 4p 
{2-3, 3} - 12 4 1 ((2,3|3;2)) =x, 24 
{4-3, 3} 16 24 4 3 ((2, 3 | 3)) 48 
{2-4, 3} 16 24 6 2 ((2, 3 | 4;2)) 48 
(3-4, 3} 24 36 6 4 ((2, 3 | 4;3)) 72 
{6-4, 3} 48 7 6 10 ((2, 3 | 4)) 144 
{2-5, 3} 40 60 12 5 ((2,3|5;2)) = Asx, 120 
{3-5, 3} 60 90 12 10 ((2,3|5;3)) = 4.x, 180 
{4-5, 3! 80 120 12 15 ((2, 3 | 5; 4)) 240 
(6-5, 3} 120 6180~—Ss«12 25 ((2,3|5;6)) = Asx, 360 
{12-5, 3} 240 360 12 55 ((2, 3 | 5)) 720 
{2-3, 4} 12 24 8 3 ((2,4|3;:2))=2S,xG, 48 
{4-3, 4} 24 48 8 9 ((2,4|3:4))=SG,.xG, O68 
{8-3, 4} 48 96 8 21 ((2, 4| 3)) 192 
{2-3, 5} 24 60 20 9 ((2,5|3;2)) = 4.x, 120 
{4-3, 5} 48 120 20 27 ((2, 5 | 3; 4)) 240 
(5-3, 5} 60 150 20 36 ((2,5|3;5)) = Asx. 300 
{10-3, 5} 120 300 20 81 ((2,5|3:10)) = Asx Gro 600 
{20-3, 5} 240 600 20 171 ((2, 5 | 3)) 1200 





5. Regular maps of type {p, 3}. In some respects the most interesting 


regular maps are those which have 3 faces at a vertex. We shall now proceed } 


to enumerate these maps when the number of faces is small. 

To facilitate reference to it, a map of type {p,q} having k faces will be 
denoted by the symbol *{, g}. In particular we shall now study the regular 
maps *{p, 3} for small values of k. 














REGULAR MAPS 461 


TABLE II 
THe REGULAR Maps {j-p,j-¢} (p > 9:7 > 2) 











Map No Ni N2 Genus Group Order 
i-2.5-2} G|\p+2)  p» jp 2 i-1DP (p, 2 | 2; 7) 2ip 
{(p+2)-p, (6 +2) -2} pb p(p+2) 2 $p(p+1) (p, 2 | 2) 2p(p+2) 

{2-3, 2-3} 4 12 4 3 (3,3 |2;2) = %x€, 24 
{4-3, 4-3} 4 24 4 9 (3,3 |2;4) =x, 48 
{8-3, 8-3} 4 48 4 21 (3, 3 | 2) 96 
{2-4, 2-3} 8 24 6 6 (4, 3 | 2; 2) 48 
{7-4, 7-3} 8 84 6 36 (4, 3 | 2;7) 168 
(14-4, 14-3} 8 168 6 78 (4, 3 | 2) 336 
{2-5, 2-3} 20 60 12 15 (5,3 | 2;2) = U.x€, 120 
(4-5, 4-3} 20 120 12 45 (5,3|2;4)2%.x, 240 
{8-5, 8-3} 20 240 12 105 (5,3 |2;8) =x, 480 
{16-5, 16-3} 20 480 12 225 (5, 3 | 2; 16) = U.K Crs 960 
(32-5, 32-3} 20 960 12 465 (5, 3 | 2) 1920 





From Lemma 4 we deduce 


THEOREM 1. The only regular map ‘'{p,3} is the map {6,3}1.0 of genus 1 
(6, p. 25). 


Lemmas 2 and 3 imply 


THEOREM 2. There is no regular map *{p, 3}. 


Turning now to the case k = 3, we exhibit in Figure 3 a part of the regular 
tessellation {p, 3}. In virtue of Lemmas 2 and 3 the three faces of a regular 





FiGureE 3 











462 F. A. SHERK 


map *{p,3} are situated in the manner of the faces a, 8, and y. Since the 
map is 3-faced, 6 must be identified with 8. Thus, representing faces by right 
cosets* of {R}, where the automorphisms R and S act in the indicated manner, 
we have 


{R}SR? = {R}S. 
In particular, there is an integer / such that 
SR? = R'S. 


Thus the generators of the group of *{p, 3} must satisfy this relation as well 
as 


R® = S* = (RS)? = E. 
It follows that 


R' = SR*S"' = S*R*S? = S-'RSR*S“'R-'S 
= S'RR'R"'S = S“'R'S = R’, 


and the extra relation reduces to 


Moreover, Lemma 5 shows that p is even. Thus the abstract definition 
5.1 R? = S* = (RS)? = E, R2s 
is a special case of 3.2, and Lemma 6 shows that » = 2 or 6. Thus we have 


THEOREM 3. There are exactly two regular maps *|p,3}, namely {2,3} of 
genus zero and {3-2,3} of genus 1. 


In the notation of Coxeter (6, p. 25), {3-2, 3} is the map {6, 3}1.:. 

Lemma 6 also shows that the identification of faces carried out in the 
above case can yield only a 3-faced regular map (of type {p, 3}). Thus the 
four faces of any map ‘{p, 3} are situated in the manner of a, 8, y, and 6 
in Figure 3, and « must be identical with 8. By similar reasoning to that 
used in proving Theorem 3, we now have 


THEOREM 4. There are exactly three regular maps *|p,3}, namely {3,3} of 
genus zero, {2-3,3} of genus 1, and {4-3,3} of genus 3. 


In the notation of Coxeter (6, p. 25; cf. 7, p. 116), {2-3,3} is the map 
{6, 3} 2.0. 


Turning now to the case °{p, 3}, we prove 
THEOREM 5. There is no regular map °| p, 3}. 


*Since the rotation R carries a into itself, a may be represented in the group by the sub- 
group {R} while the other faces are represented by right cosets of {R} (2, p. 270). Thus there 
is a (1, 1) correspondence between the faces of a regular map and the right cosets of {R} in 
its group. 

The reader is requested to insert the letter 8 in the face to the right of a (Figure 3). 





—_—— Or or re 











rr 7 ee a 





— 











REGULAR MAPS 463 


Proof. Suppose that a regular map °{p, 3} exists. Then in view of the 
results of the two previous theorems, its faces must be situated in the manner 
of a, 8, y, 6, and ¢ in Figure 3, and ¢ must be identical with 8. The group of 
the map must therefore satisfy the relations 


5.2 R? = S* = (RS)? = E, R25, 


where p = 0 (mod 4). But by Lemma 6, 5.2 defines a group of order 69, 
while a regular map °{p, 3} must have a group of order 5p. The group defined 
by 5.2 cannot have a factor group of order 5p; hence there is no regular 
map °{p, 3}. 

It was noted in the above proof that 5.2 defines a group of order 6p. Hence 
the identification of ¢ with 8 in Figure 3 yields regular maps *{ p, 3}. We ask 
if any other identification of faces in the tessellation {p, 3} will yield 6-faced 
regular maps. The only other possible arrangement is to let a, 8, y, 5, €, and 
¢ be the 6 faces and identify 7 with 8. This gives rise to a group satisfying 
the relations 


5.3 R? = S* = (RS)? = E, R=S 


where p = 0 (mod 5). By Lemma 6 this defines a group of order 12), while 
a regular map *{p, 3} must have a group of order 6p. Thus relations 5.3 are 
insufficient to define the group which we seek, and we must add a further 
relation. Since the regular map we seek is 6-faced, the face @ of Figure 3 
must be identified with a, 8, y, 6, e, or ¢. Each face is surrounded by 5 different 
faces, and therefore @ can only be identified with 8; in symbols 


{R}SR“SR? = {R}S. 


In particular, there is an integer / such that SR-'SR? = R'S. However, if we 
add this relation to those of 5.3 and enumerate right cosets of {R} by the 
Todd-Coxeter method (7, p. 12), a collapse occurs in the tables which reduces 
the number of cosets of {R} to one. Thus we eliminate the possibility of a 
group of order 6p. 

Since the Todd-Coxeter method will be employed many times in similar 
situations, it is perhaps advisable to exhibit the tables in this case. They are 


RRRRR...R SSS RSRS RS SR 

PEECGESA § 9 1231 11231 Lia 122 

234562 62 4654 34623 22% 233 
45454 33 1 311 
56565 

SR"'S R* R'S 

12 652 112 

23 235 265 


313124 354 


The table for the fifth relation indicates that R' carries coset 2 into coset 6 















464 F. A. SHERK 


and at the same time carries coset 3 into coset 5. Transferring this informa- 
tion to the table for the first relation, we see that cosets 2, 3, 4, 5, and 6 are 
identical. But the table for the second relation then indicates that coset 2 
= coset 1 and the collapse is complete. It is important to notice that the 
enumeration of cosets is carried out without knowing the specific values of 
p and /. This time-saving fact should be kept in mind and applied to any 
particular case when an enumeration of cosets is desired in the following 
pages. 
Collecting the above results, and taking Lemma 6 into account, we have 


THEOREM 6. The only regular maps *| p, 3} are {4,3} of genus zero, {2-4, 3} 
of genus 2, {3-4, 3} of genus 4, and {6-4, 3} of genus 10. 


It is not difficult to classify completely the regular maps *{p, 3} for other 
small values of k by using the above methods. For example, it can be shown 
quite easily that there is only one regular map 7{p, 3}, namely (6, p. 25) the 
map {6, 3}2,. of genus 1. For the present, however, we shall confine ourselves 
to the following result, obtained by an examination of the proofs of Theorems 
3-6. 


THEOREM 7. The faces of any regular map, *\p,3} (k > 6) are surrounded 
by at least five other distinct faces. 


The regular maps *{p, 3} (& < 6) are listed in Table III. 


TABLE III 


THE REGULAR Maps or Type {p, 3} wit Six or Fewer Faces 

















Symbol No NM N2 Genus Group 
{6, 3}1.0 2 3 1 1 ¢, 
{2, 3} 2 3 3 0 (2, 3]*= D, 
{3-2, 3} 6 9 3 1 ((2, 3 | 2)) 
{3, 3} 4 6 4 0 (3, 3]*= A, 
{2-3, 3} 8 12 4 1 ((2,3|3;2)) >=%x, 
14-3, 3} 16 24 4 3 ((2, 3 | 3)) 
{4, 3} ~ 12 6 0 [4, 3]*= S, 
{2-4, 3} 16 24 6 2 ((2, 3 | 4; 2)) 
{3-4, 3} 24 36 6 4 ((2, 3 | 4; 3)) 
{6-4, 3} 48 72 6 10 ((2, 3 | 4)) 


6. The arithmetically possible maps of genus 3. The first step in 
determining the regular maps of genus 3 is to list all the maps of type {p, g} 
whose vertices, edges, and faces satisfy 1.1 with x = — 4. We call them the 
arithmetically possible maps of genus 3. 

To facilitate the enumeration of these maps we prove the following theorem, 
due to Coxeter: 




















REGULAR MAPS 465 


THEOREM 8. For any map of type |p, q} on a surface of characteristic x < 0, 
if p>q, then gq <2 (2 — x). 


Proof. In terms of p, g, and k (the number of faces of {, g}), formula 1.1 is 


kp kp 

ri a +k=*x, 
that is, 
6.1 2kp — kpg + 2kq — 2xq = 0. 


Now p>q, k > 1, and x < 0; hence 
(l-xp-q>xq, 
ki(l—x)p—q] > — xq, 
(1 — x)kp > kq — xq, 
x)kp > 2kq — 2xq. 
But by 6.1, 2kqg — 2xq = kpq — 2kp. Therefore 
2(1 — x)kp > kpgq — 2kp, 
[2(2 — x) — glkp > 0, 
q < 2(2 — x). 


bo 
~ 
_ 


In particular, when the map of type {p,q} lies on a surface of genus 3, 
x = — 4, and gq < 12. Now 6.1 with x = — 4 may be written in the form 


kp(q — 2) = 8q + 2kg. 
6.2 pb = (8q/k + 2q)/(q — 2). 


We tabulate the solutions of 6.2 for specified values of g. Since 3 < ¢g < 12 
(cf. Lemma 1 of § 3) when p > gq, we have only 10 diophantine equations to 
consider in order tu list all the arithmetically possible maps of type {?, g} 
(p > g) and genus 3. The maps omitted (those for which p < q) are simply 
the duals of maps already listed. 

From 1.3 we see that pk must be even; hence any solution of 6.2 for which 
bk is odd does not yield an arithmetically possible map. With this in mind, a 
complete list of the arithmetically possible maps of type {p,q} (p > q) on 
a surface of genus 3 is given by Table IV. The final column of the table 
indicates the order which the group of the map must have if it happens to 
be regular. The rows are numbered for easier reference. 


7. The regular maps of genus 3. The problem now is to isolate the 
regular maps which lie among the arithmetically possible maps in Table IV. 
We determine first the regular maps '{p,g}. Then, using the method of 
Brahana (2, p. 280) we determine the regular maps *{p, g}. We then note the 
regular maps of genus 3 which occur in the tables of Coxeter mentioned 
previously. Finally, the remaining possibilities in Table IV will be tested by 
recourse to the results of §§ 3, 4, and 5, and by methods not unlike those 
used there. 








466 F. A. SHERK 


TABLE IV 


THE ARITHMETICALLY PossIBLE Maps oF Type {~,¢} (f > g) AND GENus 3 











Type No M N2 g 
1. {12, 12} 1 6 1 12 
2. {14, 7} 2 7 1 14 
3. {20, 4} 5 10 1 20 
4. {30, 3} 10 15 1 30 
5. {8, 8} 2 8 2 16 
6. {9, 6} 3 9 2 18 
7. {10, 5} 4 10 2 20 
8. {12, 4} 6 12 2 24 
9. {18, 3} 12 18 2 36 
10. {14, 3} 14 14 3 42 
11. 16, 6} 4 12 4 24 
12. {8, 4} 8 16 4 32 
13. {12, 3} 16 24 4 48 
14. 16, 5} 6 15 5 30 
15. {10, 3} 20 30 6 60 
16. {5, 5} 8 20 8 40 
17 {6, 4} 12 24 8 48 
18 {9, 3} 24 36 8 7 
19 {8, 3} 32 48 12 96 
20 {5, 4} 20 40 16 80 
21 {7, 3} 56 84 24 168 





Applying Theorem 1, we discover two 1-faced regular maps of genus 3, 
namely '{12, 12}. and '{14, 7}, and exclude possibilities 3 and 4 in Table IV. 
The regular maps '{12, 12} and '{14,7} may be denoted by the symbols 
{12, 12}1,.0 and {14, 7}2, in analogy with the corresponding cases of regular 
maps of genus 2 (7, p. 141). The group of {12, 12}1.0 is the cyclic group of 
order 12, while the group of {14, 7}. is the cyclic group of order 14. 

In virtue of Lemmas 2 and 3 we may immediately rule out numbers 7 
and 9 in Table IV as possibilities for regular maps. To determine whether 
the remaining 2-faced maps are regular or not, we use the method initiated 
by Brahana (2, p. 280), that is, given the 2-faced map of type {p,q}, we 
look for a group generated by R and T = RS (cf. 3.1), with the defining 
relations 

R? = T? = E, TRT = R" 
where n? = 1 (mod p), and implying that RT is of the desired period, namely g. 

In case no. 5 we have p = 8, and hence 

7.1 n? = 1 (mod 8). 


Solutions are » = 1,3,5, and 7. If » = 1, then RT = TR and RT is of 
period 8. Thus there exists a regular map of type {8,8} and genus 3 whose 
group is defined by the relations 

R* = T? = E, RT. 





or 








~~ eP 


-—- eo. 


























REGULAR MAPS 467 


The map is analogous to the regular map {6, 6}. of genus 2 (7, p. 141); accord- 
ingly we denote it by the symbol {8, 8}2. The solution m = 3 of 7.1 gives no 
further regular map of type {8, 8}, nor does the solution » = 7. But when 
n = 5, RT is again of period 8 and hence there exists another regular map 
of type {8, 8} and genus, 3, whose group has the abstract definition 


R' = T* = E, TRT = R'. 


It was shown by Coxeter and Moser (7, p. 114) that this abstract definition 
may be put in the form 


7.2 T? = E, TST = S-* 


and that the above relations define Miller's group (2,2 | 2). Accordingly, 
the map is denoted by the symbol {4-2, 4-2} (cf. Table II). This is the ‘map 
of type {8, 8}’’ mentioned by Coxeter and Moser (7, p. 114), a member of 
the sub-family of regular maps {(r + 1)-(r — 1), (r + 1)-2} on a surface of 
genus 4r(r — 1). 

Proceeding in the manner outlined above, we eliminate case 6 in Table IV, 
but discover corresponding to case 8 a regular map of type {12, 4}. Its group 
has the abstract definition 


R® = T* = E, TRT = R‘. 


This is the group (6, 2 | 2; 2) (7, p. 114), and therefore the map is denoted by 
the symbol {2-6, 2-2}. It is a member of the sub-family of maps {2-2p, 2-2}, 
to which Coxeter and Moser give the symbol {4p, 4}:.; (7, p. 115). Another 
symbol for the group (6,2|2;2) is €, X Ds; (cf. 7, p. 10, (1.861) when 
y= 5, m = 3, n = 2). 

An imporiant family of regular maps is the family whose members are 
characterized by specified Petrie polygons. A Petrie polygon of a map is a 
““zig-zag’’ along its edges such that every two but no three successive edges 
of the polygon are edges of a single face. For example the path ABCDEF... 
of Figure 4 is a Petrie polygon. A regular map of type {, g} characterized 
by its r-gonal Petrie polygons is denoted by the symbol {?, q},. If the map 
is on an orientable surface, then r is even (7, p. 111) and the group of the 
map has the abstract definition 


7.3 R? = St = (RS)? = (R°S’)" = E 


where n = $r (4, p. 126). The dual of {/, g}, also has r-gonal Petrie polygons 
and is denoted by the symbol {@, )},. 

The previous use of the symbols {14,7}. and {8, 8}. is easily shown to 
be justified. In addition to these two cases of regular maps { p, g}2, of genus 3, 
the tables of Coxeter and Moser (7, p. 140) contain {8, 3}. and {7, 3}s, corre- 
sponding to entries 19 and 21 in Table IV. We thus establish the existence 
of two more regular maps of genus 3. The map {7, 3}, was discussed in 1879 


by Klein (10); Dyck (8) examined {8, 3}, in 1880. 








468 F. A. SHERK 


\ V y VY £ 
H G 














AK K XK K 


FIGURE 4 


The relations 7.3 form an abstract definition for the group (2, q, p; ”) 
(4, p. 86). Thus in particular (2, 3, 8; 3) is the group of {8, 3}. and (2, 3, 7; 4) 
is the group of {7, 3}s. The latter is the simple group LF(2, 7) (7, p. 96). 

We have not yet shown that {8, 3}, and {7, 3}, are the only regular maps 
of types {8,3} and {7,3} on this surface. We shall postpone the proof until 
we are ready to make a systematic study of all the regular maps of type 
{p, 3} and genus 3. 

We now seek regular maps of genus 3 among the regular maps having 
specified holes. A hole is a path along the edges of a map such that at each 
vertex visited we leave two faces on (say) the left (3, p. 38). Thus, for 
example, the path ACDGHIA in Figure 4 is a hexagonal hole. A regular 
map of type {p,q} characterized by its m-gonal holes is denoted by the 
symbol {p,q|m} and its group, denoted by (p,q|2,m), has the abstract 
definition 


7.4 R? = S* = (RS)? = (R-'S)* = E 
(4, p. 74). If m = 2, then p and g are even (7, p. 109). Suppose that m = 2, 
p = 4, and q is any even number. Then the final relation in 7.4 is 
(R-'S)? = E. 
Since R* = (RS)? = E, this relation implies 
R*SR? = RS“'R = S, 
whence 


R? = S. 


























REGULAR MAPS 469 


Thus 


(4, q | 2,2) = ((2, q | 2; 2)) 
and 
{4,q| 2} = {2-2, g} 
(cf. Table I). Dually 
{g,4| 2} = {g, 2-2}. 


Consulting Coxeter and Moser (7, p. 109), we discover the regular map 
{8,4|2} = {8, 2-2}, of genus 3, which occurs in Table IV as case 12. 

Having found one regular map of type {8, 4} and genus 3, we ask if there 
are any others. To answer this question we exhibit in Figure 5 a diagram 
of the arrangement of the four faces a, 8, y, and é of any regular map ‘{8, 4}, 





‘ 
Ne 





Ficure 5 


this arrangement being the only one possible because of Lemmas 2 and 3 
of § 3. Again by Lemma 2 and 3 the face « of the regular tessellation {8, 4}, 
which must now be’ identified with one of the former 4 faces, cannot be 
identified with a or 6. Applying Lemma 5 to the face a, we see that « cannot 
be identified with y. Thus « must be identified with 8; in symbols 


{R}SR? = {R}S. 











470 F. A. SHERK 


In particular for some integer /, 


SR? = R'S. 
Since R' is of the same period as R?, / = +2. If 1 = — 2. the group of the 
map must satisfy the following relations: 
7.5 R® = St = (RS)? = E, SR? = R-°S. 


The final relation rewritten is 
S-'R(RSR)R = E, 
S'RS-'R = E (since (RS)? = EB). 


Thus 7.5 is identical with 7.4 when p = 8, g = 4, and nm = 2. We have, 
therefore, no new regular map for the case / = — 2. When / = 2, the group 
of the map must satisfy the relations 


R* = S‘ = (RS)? = E, R?2 5S, 


which define the group ((2, 4 | 2;4)) = ((2, 4 2)). Thus the above choice of 
l yields a second regular map of type {8,4} and genus 3, which is denoted 
by the symbol {4-2, 4} (cf. Table I). 


In an extension of his concept of a hole, Coxeter (3, p. 59) introduced the 
notion of a second hole. This is a path along the edges of a map such that at 
each vertex visited we leave three faces on (say) the left. Thus, for example, 
the path ACEJ ... of Figure 4 is a second hole. A regular map of type {p, q} 
characterized by its m-gonal second holes is denoted by the symbol {, q |, }, 
and its group has the abstract definition 


7.6 R? = St = (RS)? = (RS-*)" = E. 


Coxeter (3, p. 61) compiled a list of regular maps {p, q |, 2}. There are three 
unfortunate omissions in this table, which were later corrected by Coxeter. 


They are 


{4, 6 |, 2} 12 24 8 3 S.X @: 48 
{5, 6 |, 2} 24 60 20 9 As xX @ 120 
{3, 11 |, 4} 2024 3036 552 231 LF(2, 23) 6072 


In the complete table there are three regular maps of genus 3, namely {3, 8}, 3}, 
{3,7 |,4}, and {4,6]|, 2}, corresponding to entries 19, 21, and 17 in Table 
IV. The group of {3, 8 |, 3} has the abstract definition 7.6 with p = 3, g = 8, 
and n = 3 while the group of {3,7 |, 4} is 7.6 with p = 3, gq = 7, and m = 4. 
It is easily seen by comparing 7.6 with 7.3 that any map {3, g|, m} is identical 
with the map {3, g}2,. Thus {3,8 |, 3} is {3, 8}. and {3,7 |, 4} is {3, 7}s. Be- 
cause of the ease with which it may be dualized, the latter symbol in each 
case is used exclusively to denote the map. 
The group of the regular map {4, 6 |, 2} has the abstract definition 


R* = S‘ = (RS)? = (RS)? = E. 


sae el 





















REGULAR MAPS 


It is easily shown that the above relations are equivalent to 


7.7 R* = St = (RS)* = E, ‘=F. 


But these relations define the group ((2, 4 | 3; 2)) and therefore {4, 6 |, 2} will 
be denoted by the symbol {4, 2-3}, which dualizes more easily. 

We wish to determine whether or not there are any other regular maps 
of type {6,4} and genus 3. To this end we exhibit in Figure 6 a part of the 


G ¥ 


‘ 








& 


Ficure 6 
regular tessellation {6,4} in the hyperbolic plane. There are three possible 
arrangements of the faces of a regular map around the face a, namely 


(i) only two distinct faces are contiguous with a, 
(ii) only three distinct faces are contiguous with a, 
(iii) six distinct faces are contiguous with a. 


Case (i) is easily dispensed with, for if there were such a regular map, we 
would have the relation 


{R}S = {R}SR’, 
which implies, for some integer /, the relation 


R'S = SR’. 








472 F. A. SHERK 


When this is added to 
7.8 R* = S‘ = (RS)? = E 


it is not hard to show that there are only 4 right cosets of the subgroup {R}. 
This eliminates the possibility of an 8-faced map. Case (iii) is likewise easily 
dispensed with, for it is readily seen that the face 6 (Figure 6) must be one 
of the six faces which surround a. Taking into account the symmetry of the 
map, this implies 


{R}S? = {R}SR-? (j = lor2), 
which in turn implies either 
R'S? = SR“ 
or 
R*°S*? = SR“ 


for some integers / and m. But if we add either one of these relations to 7.8 
and enumerate cosets of {R}, we obtain a collapse which implies a breakdown 
of the proposed structure of the map. Thus case (ii) is the only possible 
arrangement of faces surrounding a that can yield a regular map. In this 
case y and 6 must be identical; in symbols 


{R}S = {R}SR’. 
This implies in particular that 
R'S = SR’ 


for some integer /. If the period of R is to remain at 6, then / = 3 and the 
only regular map that this case yields is the map whose group has the abstract 
definition 7.7, that is, the group ((2, 4 | 3; 2)). Therefore {2-3, 4} is the only 
regular map of type {6, 4} and genus 3. 

We shall now consider the maps *{p, 3} with & > 3 in the order in which 
they appear in Table IV, applying the results of §§ 3 and 5. 

By Theorem 3, case no. 10 is eliminated; there is no reguiar map *{14, 3}. 
There is, however, by the result of Theorem 4, exactly one regular map of 
type {12,3} and genus 3. It is the regular map {4-3, 3}, and corresponds to 
case 13 in Table IV. 

Theorem 6 eliminates case 15 as a possibility for a regular map. 

Applying Theorem 7 and Lemma 5 to case 18, we see that if the number 
of faces of a regular map of type {9, 3} is > 6 it must be > 10. But the arith- 
metically possible map of type {9, 3} and genus 3 has only 8 faces and therefore 
cannot be regular. 

Having already found one regular map of type {8, 3} and genus 3, namely 
{8, 3}«, we now check the possibility of others. In view of Theorem 7 and 
Lemma 5 we know that a face a of a regular map '*{8, 3} must be surrounded 
by 8 other distinct faces. Now either the face 8 of the tessellation {8, 3} 








Ol 





aS 








REGULAR MAPS 473 





FiGurRE 7 


(cf. Figure 7) is one of these 8 faces, or else it is one of the 3 remaining faces. 
If the former alternative is true we have, taking into account the symmetry 
of the map, 

{RJSR“S = {R}SR- (j = 2or 3). 
In particular, we have for some integers / and m, 

R'SR“S = SR 
or else 

R*SR“'S = SR-. 
Adding each of these in turn to the relations 


7.9 R§ = S* = (RS)? =E 


“% 


and enumerating cosets of {R}, we prove in both cases that the structure 
of the faces surrounding a is broken down by the proposed identification 
of 8. Hence 8 must be a tenth face of the map. If 8 is not one of the faces 
surrounding a@ then neither is y. Moreover y is not identical to 6 since both 











474 F. A. SHERK 


8 and + border on one and the same face. Thus 7 is an eleventh face. However, 
Lemma 5 implies that 6 must be identical with 8; hence 


{R}SRoS = {R}SR-SR?. 
Thus, for some integer /, 
R'SR-'S = SR-'SR?’. 


If we add this relation to 7.9 and enumerate cosets of {R} in the group thus 
defined, we obtain 12 cosets and the correct group structure if, and only if, 
l = 2. The relation R*SR-'S = SR-'SR? rewritten is 


R°SRS-' = SR-SRRS. 
Since (RS)? = E, this relation becomes 


R*SSR = SR-2S-'S-!R-! 
(R2S—')* = E (since S* = E£). 


This is the fourth defining relation of the group of the regular map {8, 3}. 
(cf. 7.3 when p = 8, g = 3, m = 3). Hence {8, 3}¢ is the only regular map 
of type {8,3} and genus 3. 

In a similar fashion we may prove that {7,3}, is the only regular map 
of type {7,3} and genus 3. The method is now apparent; we build up the 
structure of the map step by step, using the theory of §§ 3 and 5, and testing 
each step by examining its effect on the group of the regular tessellation {7, 3}. 

The only entries in Table IV which remain to be considered are 11, 14, 16, 
and 20. In case 11 we seek a 4-faced regular map of type {6, 6}. In virtue 
of Lemmas 2, 3, and 5 of § 3, and the fact that the map has only 4 faces, it 
follows that three distinct faces surround a vertex, and their arrangement 
must be like that of a, 8, and y in Figure 8. Thus we have the relation 


{R}S* = {R}. 
In particular there is an integer / such that 
S* = RR‘. 
Since S and R are both of period 6, / = 3. Adding the relation S* = R* to 
R* = S* = (RS)? =E 
we note that the relations may be put into the form 
R= $* = Z, (RS)? = Z? = E, 


which defines the group (3, 3,| 2; 2). This is the group of the regular map 
{2-3, 2-3}, isomorphic to the group A, X ©: (7, p. 73). The map {2-3, 2-3} 
is the only regular map of type {6, 6} and genus 3. 

The remaining three cases, 14, 16, and 20 in Table IV, yield no regular 
maps. As in previous cases this fact may be verified by assuming in each 

















REGULAR MAPS 





Ficure 8 











, ing faces of the regular tessellation from which the map would arise. Every 
‘ identification that is possible proves to be unfruitful. With the help of the 
t lemmas of §3, this procedure is neither difficult nor unduly long. 
' 
; TABLE V 
» THe REGULAR Maps or Genus 3 
‘ — Es — — —___—- —— 
Map No NM N2 Dual Group Order 
Oo {12, 12}1.0 1 6 1 Self-dual ©). 12 
{14, 7} 1 7 2 {7,14}. Si4 14 
{8, 8}. 2 8 2 Self-dual 6.x, 16 
{4-2, 4-2} 2 & 2 Self-dual (2, 2 | 2) 16 
{2-6, 2-2} 2 12 6 {2-2,2-6} (,2|2;2) >€,xD; 24 
{2-3, 2-3} 4 12 4 Self-dual (3,3/2;2) =%,x€, 24 
{8, 2.2} 4 16 8 {2-2,8} ((2,8|2;2))=(8,4|2,2) 32 
p {4-2, 4} 4 16 8 {4, 4-2} ((2, 4 | 2)) 32 
(4-3, 3} 4 24 -16 {3,4-3} ((2, 3 | 3)) 48 
{2-3, 4} s @ 12 {4,2-3} ((2,4|3;2)) 2 Sx, 48 
| (8, 3} 12 48 32 {3,8}. (2, 3, 8; 3) 96 
ir {7, 3}s 24 84 56 (3, 7}s (2, 3, 7; 4) = LF(2, 7) 168 








case that such a map exists, and then attempting to find its group by identify- 


Reference 


(7, p. 61) 
(7, p. 
(7, p 
(7, p. 


(7, p. 


140) 
140) 
114) 
115) 


(7, p. 109) 


. 115) 
140) 
278) 
(7, p. 140) 











476 F. A. SHERK 


The 12 regular maps of genus 3 (a map and its dual counted as one) are 
listed in Table V. Figures 9-20 are drawings of the maps in which the 
edges are numbered. Those bordering edges which are numbered alike are 
to be identified. 





Figure 13: {2-6, 2-2} FicureE 14: {2-3, 2-3} 











REGULAR MAPS 

















477 
, 16 12 
. ¢ 
10 Oo 
15 1+ 3 
1 9 1 6 
4 
1 e 7 
6 6 
{ 5 
Ficure 15: {8-2, 2} Ficure 16: {4-2, 4} 
45 13 17 














4 


Ficure 17: {3, 4-3} 











478 





F. A. SHERK 


Ficure 18: {4, 2-3} 














38 
46 
41 
43 
3Y 22 a 
36 > 
44 
3 16 
6 
7 
x 
4 7 < 
45 
: 
3 





Ficure 19: {3, 8}, 






















REGULAR MAPS 














Figure 20: {3, 7}s 





F. A. SHERK 


REFERENCES 


H. R. Brahana, Regular maps on an anchor ring, Amer. J. Math., 48 (1926), 225-40. 
, Regular maps and their groups, Amer. J. Math., 49 (1927), 268-84. 
. H.S, M. Coxeter, Regular skew polyhedra in three and four dimensions and their topological 
analogues, Proc. London Math. Soc. (2), 43 (1937), 33-62. 
, The abstract groups G”’?, Trans. Amer. Math. Soc., 45 (1939), 73-150. 
, Regular polytopes (London, 1948). 
, Configurations and maps, Reports of a Math. Colloq. (2), 8 (1948), 18-38. 
- H.S. M. Coxeter and W. O. J. Moser, Generators and relations for discrete groups, Ergebn. 
Math., 14 (1957). 
. W. Dyck, Notiz ueber eine regulare Riemann'sche Fliche vom geschlechte drei und die zuge- 
hérige “‘normalcurve” vierter ordnung, Mat. Ann., 17 (1880), 510-16. 
. R. Frucht, A one-regular graph of degree three, Can. J. Math., 4 (1952), 240-7. 
10. F. Klein, Ueber die transformationen siebenter ordnung der elliptischen functionen, Mat. 
Ann., 14 (1879), 428-71. 
il. , Lectures on the icosahedron and the solution of equations of the fifth deegree trans. 
G. C. Morrice (London, 1913). 
12. G. C. Shephard, Regular complex polytopes, Proc. London Math. Soc. (3), 2 (1952), 82-97. 


University of Toronto 











To be published this Spring 


DIFFERENTIAL GEOMETRY 
By ERWIN KREYSZIG 


This book is intended to meet the need for a text introducing advanced 
students in mathematics, physics, and engineering to the field of differential 
geometry. It is self-contained, requiring only a knowledge of the calculus. 
The material is presented in a simple and understandable brt rigorous 
manner, accompanied by many examples which illustrate the ideas, 
methods, and results. The use of tensors is explained in detail, not omitting 
little formal tricks which are useful in their applications. Though never 
formalistic, it provides an introduction to Riemannian geometry. 

The theory of curves and surfaces in three-dimensional Euclidean 
space is presented in a modern way, and applied to various classes of 
curves and surfaces which are of practical interest in mathematics and its 
applications to physical, cartographical, and engineering problems. Con- 
siderable space is given to explaining and illustrating basic concepts such 
as curve, arc length, surface, fundamental forms; covariant and contra- 
variant vectors; covariant, contravariant and mixed tensors, etc. 

Interesting problems are included and complete solutions are given 
at the end of the book, together with a list of the more important formulae. 
No pains have been spared in constructing suitable figures. 

Erwin Kreyszic is Professor, Department of Mathematics, Ohio State 
University. 

xxi + 356 pages 6 x 9 inches $8.50 


PROCEEDINGS OF THE FOURTH 
CANADIAN MATHEMATICAL 
CONGRESS 


Edited by M. S. Macphail 


Included in this volume are minutes of the Summer Seminar and the 
Quadrennial Congress held in 1957; texts of the panel discussions on high 
school mathematics and on mathematics in the university and in industry; 
abstracts of contributed papers and texts of invited lectures. 

viii + 184 pages 6 x 9 inches $6.00 


UNIVERSITY OF TORONTO PRESS 








